Vui Seng Chua commited on
Commit
31bfa4b
·
1 Parent(s): 0375589

Add content

Browse files
.gitattributes CHANGED
@@ -32,3 +32,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
32
  *.zip filter=lfs diff=lfs merge=lfs -text
33
  *.zst filter=lfs diff=lfs merge=lfs -text
34
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
32
  *.zip filter=lfs diff=lfs merge=lfs -text
33
  *.zst filter=lfs diff=lfs merge=lfs -text
34
  *tfevents* filter=lfs diff=lfs merge=lfs -text
35
+ openvino_model.xml filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,113 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - image-classification
5
+ - vision
6
+ - generated_from_trainer
7
+ datasets:
8
+ - food101
9
+ metrics:
10
+ - accuracy
11
+ model-index:
12
+ - name: jpqd-swin-b-15eph-r1.00-s2e5-mock-main-merge-pr2
13
+ results:
14
+ - task:
15
+ name: Image Classification
16
+ type: image-classification
17
+ dataset:
18
+ name: food101
19
+ type: food101
20
+ config: default
21
+ split: validation
22
+ args: default
23
+ metrics:
24
+ - name: Accuracy
25
+ type: accuracy
26
+ value: 0.9144158415841585
27
+ ---
28
+
29
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
30
+ should probably proofread and complete it, then remove this comment. -->
31
+
32
+ # jpqd-swin-b-15eph-r1.00-s2e5-mock-main-merge-pr2
33
+
34
+ This model is a fine-tuned version of [microsoft/swin-base-patch4-window7-224](https://huggingface.co/microsoft/swin-base-patch4-window7-224) on the food101 dataset.
35
+ It achieves the following results on the evaluation set:
36
+ - Loss: 0.2970
37
+ - Accuracy: 0.9144
38
+
39
+ ## Model description
40
+
41
+ More information needed
42
+
43
+ ## Intended uses & limitations
44
+
45
+ More information needed
46
+
47
+ ## Training and evaluation data
48
+
49
+ More information needed
50
+
51
+ ## Training procedure
52
+
53
+ ### Training hyperparameters
54
+
55
+ The following hyperparameters were used during training:
56
+ - learning_rate: 5e-05
57
+ - train_batch_size: 16
58
+ - eval_batch_size: 128
59
+ - seed: 42
60
+ - gradient_accumulation_steps: 4
61
+ - total_train_batch_size: 64
62
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
63
+ - lr_scheduler_type: linear
64
+ - lr_scheduler_warmup_ratio: 0.1
65
+ - num_epochs: 15.0
66
+
67
+ ### Training results
68
+
69
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
70
+ |:-------------:|:-----:|:-----:|:---------------:|:--------:|
71
+ | 3.8787 | 0.42 | 500 | 3.9971 | 0.7163 |
72
+ | 0.8429 | 0.84 | 1000 | 0.6450 | 0.8678 |
73
+ | 0.8561 | 1.27 | 1500 | 0.4160 | 0.8945 |
74
+ | 0.5777 | 1.69 | 2000 | 0.3664 | 0.9006 |
75
+ | 12.3601 | 2.11 | 2500 | 12.0328 | 0.9023 |
76
+ | 49.0606 | 2.54 | 3000 | 48.5000 | 0.8526 |
77
+ | 75.3173 | 2.96 | 3500 | 75.5341 | 0.6942 |
78
+ | 93.6153 | 3.38 | 4000 | 93.3091 | 0.5929 |
79
+ | 103.5744 | 3.8 | 4500 | 103.1211 | 0.5846 |
80
+ | 107.7701 | 4.23 | 5000 | 108.0755 | 0.5398 |
81
+ | 109.5736 | 4.65 | 5500 | 108.7624 | 0.5855 |
82
+ | 1.8028 | 5.07 | 6000 | 1.0960 | 0.8179 |
83
+ | 1.2549 | 5.49 | 6500 | 0.6560 | 0.8695 |
84
+ | 0.7199 | 5.92 | 7000 | 0.5619 | 0.8769 |
85
+ | 0.8874 | 6.34 | 7500 | 0.5151 | 0.8859 |
86
+ | 0.7429 | 6.76 | 8000 | 0.4830 | 0.8898 |
87
+ | 0.6759 | 7.19 | 8500 | 0.4681 | 0.8926 |
88
+ | 0.5352 | 7.61 | 9000 | 0.4360 | 0.8956 |
89
+ | 0.6021 | 8.03 | 9500 | 0.4202 | 0.8979 |
90
+ | 0.5617 | 8.45 | 10000 | 0.3940 | 0.9003 |
91
+ | 0.7235 | 8.88 | 10500 | 0.3915 | 0.9000 |
92
+ | 0.5323 | 9.3 | 11000 | 0.3793 | 0.9017 |
93
+ | 0.589 | 9.72 | 11500 | 0.3670 | 0.9051 |
94
+ | 0.425 | 10.14 | 12000 | 0.3615 | 0.9059 |
95
+ | 0.7103 | 10.57 | 12500 | 0.3479 | 0.9070 |
96
+ | 0.6251 | 10.99 | 13000 | 0.3472 | 0.9073 |
97
+ | 0.623 | 11.41 | 13500 | 0.3353 | 0.9088 |
98
+ | 0.6012 | 11.83 | 14000 | 0.3292 | 0.9098 |
99
+ | 0.4984 | 12.26 | 14500 | 0.3230 | 0.9112 |
100
+ | 0.4763 | 12.68 | 15000 | 0.3158 | 0.9109 |
101
+ | 0.3209 | 13.1 | 15500 | 0.3120 | 0.9123 |
102
+ | 0.4854 | 13.52 | 16000 | 0.3057 | 0.9126 |
103
+ | 0.5472 | 13.95 | 16500 | 0.3032 | 0.9134 |
104
+ | 0.3264 | 14.37 | 17000 | 0.3013 | 0.9134 |
105
+ | 0.4136 | 14.79 | 17500 | 0.2977 | 0.9141 |
106
+
107
+
108
+ ### Framework versions
109
+
110
+ - Transformers 4.26.1
111
+ - Pytorch 1.13.1+cu117
112
+ - Datasets 2.10.1
113
+ - Tokenizers 0.13.2
all_results.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 15.0,
3
+ "eval_accuracy": 0.9144158415841585,
4
+ "eval_loss": 0.29701292514801025,
5
+ "eval_runtime": 205.742,
6
+ "eval_samples_per_second": 122.727,
7
+ "eval_steps_per_second": 0.962,
8
+ "train_loss": 17.32710409305085,
9
+ "train_runtime": 64003.0137,
10
+ "train_samples_per_second": 17.753,
11
+ "train_steps_per_second": 0.277
12
+ }
compressed_graph.dot ADDED
The diff for this file is too large to render. See raw diff
 
config.json ADDED
@@ -0,0 +1,255 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "microsoft/swin-base-patch4-window7-224",
3
+ "architectures": [
4
+ "NNCFNetwork"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.0,
7
+ "depths": [
8
+ 2,
9
+ 2,
10
+ 18,
11
+ 2
12
+ ],
13
+ "drop_path_rate": 0.1,
14
+ "embed_dim": 128,
15
+ "encoder_stride": 32,
16
+ "finetuning_task": "image-classification",
17
+ "hidden_act": "gelu",
18
+ "hidden_dropout_prob": 0.0,
19
+ "hidden_size": 1024,
20
+ "id2label": {
21
+ "0": "apple_pie",
22
+ "1": "baby_back_ribs",
23
+ "10": "bruschetta",
24
+ "100": "waffles",
25
+ "11": "caesar_salad",
26
+ "12": "cannoli",
27
+ "13": "caprese_salad",
28
+ "14": "carrot_cake",
29
+ "15": "ceviche",
30
+ "16": "cheesecake",
31
+ "17": "cheese_plate",
32
+ "18": "chicken_curry",
33
+ "19": "chicken_quesadilla",
34
+ "2": "baklava",
35
+ "20": "chicken_wings",
36
+ "21": "chocolate_cake",
37
+ "22": "chocolate_mousse",
38
+ "23": "churros",
39
+ "24": "clam_chowder",
40
+ "25": "club_sandwich",
41
+ "26": "crab_cakes",
42
+ "27": "creme_brulee",
43
+ "28": "croque_madame",
44
+ "29": "cup_cakes",
45
+ "3": "beef_carpaccio",
46
+ "30": "deviled_eggs",
47
+ "31": "donuts",
48
+ "32": "dumplings",
49
+ "33": "edamame",
50
+ "34": "eggs_benedict",
51
+ "35": "escargots",
52
+ "36": "falafel",
53
+ "37": "filet_mignon",
54
+ "38": "fish_and_chips",
55
+ "39": "foie_gras",
56
+ "4": "beef_tartare",
57
+ "40": "french_fries",
58
+ "41": "french_onion_soup",
59
+ "42": "french_toast",
60
+ "43": "fried_calamari",
61
+ "44": "fried_rice",
62
+ "45": "frozen_yogurt",
63
+ "46": "garlic_bread",
64
+ "47": "gnocchi",
65
+ "48": "greek_salad",
66
+ "49": "grilled_cheese_sandwich",
67
+ "5": "beet_salad",
68
+ "50": "grilled_salmon",
69
+ "51": "guacamole",
70
+ "52": "gyoza",
71
+ "53": "hamburger",
72
+ "54": "hot_and_sour_soup",
73
+ "55": "hot_dog",
74
+ "56": "huevos_rancheros",
75
+ "57": "hummus",
76
+ "58": "ice_cream",
77
+ "59": "lasagna",
78
+ "6": "beignets",
79
+ "60": "lobster_bisque",
80
+ "61": "lobster_roll_sandwich",
81
+ "62": "macaroni_and_cheese",
82
+ "63": "macarons",
83
+ "64": "miso_soup",
84
+ "65": "mussels",
85
+ "66": "nachos",
86
+ "67": "omelette",
87
+ "68": "onion_rings",
88
+ "69": "oysters",
89
+ "7": "bibimbap",
90
+ "70": "pad_thai",
91
+ "71": "paella",
92
+ "72": "pancakes",
93
+ "73": "panna_cotta",
94
+ "74": "peking_duck",
95
+ "75": "pho",
96
+ "76": "pizza",
97
+ "77": "pork_chop",
98
+ "78": "poutine",
99
+ "79": "prime_rib",
100
+ "8": "bread_pudding",
101
+ "80": "pulled_pork_sandwich",
102
+ "81": "ramen",
103
+ "82": "ravioli",
104
+ "83": "red_velvet_cake",
105
+ "84": "risotto",
106
+ "85": "samosa",
107
+ "86": "sashimi",
108
+ "87": "scallops",
109
+ "88": "seaweed_salad",
110
+ "89": "shrimp_and_grits",
111
+ "9": "breakfast_burrito",
112
+ "90": "spaghetti_bolognese",
113
+ "91": "spaghetti_carbonara",
114
+ "92": "spring_rolls",
115
+ "93": "steak",
116
+ "94": "strawberry_shortcake",
117
+ "95": "sushi",
118
+ "96": "tacos",
119
+ "97": "takoyaki",
120
+ "98": "tiramisu",
121
+ "99": "tuna_tartare"
122
+ },
123
+ "image_size": 224,
124
+ "initializer_range": 0.02,
125
+ "label2id": {
126
+ "apple_pie": "0",
127
+ "baby_back_ribs": "1",
128
+ "baklava": "2",
129
+ "beef_carpaccio": "3",
130
+ "beef_tartare": "4",
131
+ "beet_salad": "5",
132
+ "beignets": "6",
133
+ "bibimbap": "7",
134
+ "bread_pudding": "8",
135
+ "breakfast_burrito": "9",
136
+ "bruschetta": "10",
137
+ "caesar_salad": "11",
138
+ "cannoli": "12",
139
+ "caprese_salad": "13",
140
+ "carrot_cake": "14",
141
+ "ceviche": "15",
142
+ "cheese_plate": "17",
143
+ "cheesecake": "16",
144
+ "chicken_curry": "18",
145
+ "chicken_quesadilla": "19",
146
+ "chicken_wings": "20",
147
+ "chocolate_cake": "21",
148
+ "chocolate_mousse": "22",
149
+ "churros": "23",
150
+ "clam_chowder": "24",
151
+ "club_sandwich": "25",
152
+ "crab_cakes": "26",
153
+ "creme_brulee": "27",
154
+ "croque_madame": "28",
155
+ "cup_cakes": "29",
156
+ "deviled_eggs": "30",
157
+ "donuts": "31",
158
+ "dumplings": "32",
159
+ "edamame": "33",
160
+ "eggs_benedict": "34",
161
+ "escargots": "35",
162
+ "falafel": "36",
163
+ "filet_mignon": "37",
164
+ "fish_and_chips": "38",
165
+ "foie_gras": "39",
166
+ "french_fries": "40",
167
+ "french_onion_soup": "41",
168
+ "french_toast": "42",
169
+ "fried_calamari": "43",
170
+ "fried_rice": "44",
171
+ "frozen_yogurt": "45",
172
+ "garlic_bread": "46",
173
+ "gnocchi": "47",
174
+ "greek_salad": "48",
175
+ "grilled_cheese_sandwich": "49",
176
+ "grilled_salmon": "50",
177
+ "guacamole": "51",
178
+ "gyoza": "52",
179
+ "hamburger": "53",
180
+ "hot_and_sour_soup": "54",
181
+ "hot_dog": "55",
182
+ "huevos_rancheros": "56",
183
+ "hummus": "57",
184
+ "ice_cream": "58",
185
+ "lasagna": "59",
186
+ "lobster_bisque": "60",
187
+ "lobster_roll_sandwich": "61",
188
+ "macaroni_and_cheese": "62",
189
+ "macarons": "63",
190
+ "miso_soup": "64",
191
+ "mussels": "65",
192
+ "nachos": "66",
193
+ "omelette": "67",
194
+ "onion_rings": "68",
195
+ "oysters": "69",
196
+ "pad_thai": "70",
197
+ "paella": "71",
198
+ "pancakes": "72",
199
+ "panna_cotta": "73",
200
+ "peking_duck": "74",
201
+ "pho": "75",
202
+ "pizza": "76",
203
+ "pork_chop": "77",
204
+ "poutine": "78",
205
+ "prime_rib": "79",
206
+ "pulled_pork_sandwich": "80",
207
+ "ramen": "81",
208
+ "ravioli": "82",
209
+ "red_velvet_cake": "83",
210
+ "risotto": "84",
211
+ "samosa": "85",
212
+ "sashimi": "86",
213
+ "scallops": "87",
214
+ "seaweed_salad": "88",
215
+ "shrimp_and_grits": "89",
216
+ "spaghetti_bolognese": "90",
217
+ "spaghetti_carbonara": "91",
218
+ "spring_rolls": "92",
219
+ "steak": "93",
220
+ "strawberry_shortcake": "94",
221
+ "sushi": "95",
222
+ "tacos": "96",
223
+ "takoyaki": "97",
224
+ "tiramisu": "98",
225
+ "tuna_tartare": "99",
226
+ "waffles": "100"
227
+ },
228
+ "layer_norm_eps": 1e-05,
229
+ "mlp_ratio": 4.0,
230
+ "model_type": "swin",
231
+ "num_channels": 3,
232
+ "num_heads": [
233
+ 4,
234
+ 8,
235
+ 16,
236
+ 32
237
+ ],
238
+ "num_layers": 4,
239
+ "out_features": null,
240
+ "patch_size": 4,
241
+ "path_norm": true,
242
+ "problem_type": "single_label_classification",
243
+ "qkv_bias": true,
244
+ "stage_names": [
245
+ "stem",
246
+ "stage1",
247
+ "stage2",
248
+ "stage3",
249
+ "stage4"
250
+ ],
251
+ "torch_dtype": "float32",
252
+ "transformers_version": "4.26.1",
253
+ "use_absolute_embeddings": false,
254
+ "window_size": 7
255
+ }
eval_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 15.0,
3
+ "eval_accuracy": 0.9144158415841585,
4
+ "eval_loss": 0.29701292514801025,
5
+ "eval_runtime": 205.742,
6
+ "eval_samples_per_second": 122.727,
7
+ "eval_steps_per_second": 0.962
8
+ }
nncf_output.log ADDED
The diff for this file is too large to render. See raw diff
 
openvino_config.json ADDED
@@ -0,0 +1,86 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "compression": [
3
+ {
4
+ "algorithm": "movement_sparsity",
5
+ "ignored_scopes": [
6
+ "{re}.*PatchEmbed.*",
7
+ "{re}.*PatchMerging.*",
8
+ "{re}.*classifier.*",
9
+ "{re}.*LayerNorm.*"
10
+ ],
11
+ "params": {
12
+ "enable_structured_masking": true,
13
+ "importance_regularization_factor": 1.0,
14
+ "warmup_end_epoch": 5,
15
+ "warmup_start_epoch": 2
16
+ },
17
+ "sparse_structure_by_scopes": [
18
+ {
19
+ "mode": "block",
20
+ "sparse_factors": [
21
+ 16,
22
+ 16
23
+ ],
24
+ "target_scopes": "{re}.*SwinAttention.*"
25
+ },
26
+ {
27
+ "axis": 0,
28
+ "mode": "per_dim",
29
+ "target_scopes": "{re}.*SwinIntermediate.*"
30
+ },
31
+ {
32
+ "axis": 1,
33
+ "mode": "per_dim",
34
+ "target_scopes": "{re}.*SwinOutput.*"
35
+ }
36
+ ]
37
+ },
38
+ {
39
+ "algorithm": "quantization",
40
+ "export_to_onnx_standard_ops": false,
41
+ "ignored_scopes": [
42
+ "{re}.*__add___[0-1]",
43
+ "{re}.*layer_norm_0",
44
+ "{re}.*matmul_1",
45
+ "{re}.*__truediv__*"
46
+ ],
47
+ "initializer": {
48
+ "batchnorm_adaptation": {
49
+ "num_bn_adaptation_samples": 200
50
+ },
51
+ "range": {
52
+ "num_init_samples": 32,
53
+ "params": {
54
+ "max_percentile": 99.99,
55
+ "min_percentile": 0.01
56
+ },
57
+ "type": "percentile"
58
+ }
59
+ },
60
+ "overflow_fix": "enable",
61
+ "preset": "mixed",
62
+ "scope_overrides": {
63
+ "activations": {
64
+ "{re}.*matmul_0": {
65
+ "mode": "symmetric"
66
+ }
67
+ }
68
+ }
69
+ }
70
+ ],
71
+ "input_info": [
72
+ {
73
+ "keyword": "pixel_values",
74
+ "sample_size": [
75
+ 16,
76
+ 3,
77
+ 224,
78
+ 224
79
+ ],
80
+ "type": "float"
81
+ }
82
+ ],
83
+ "optimum_version": "1.7.0",
84
+ "save_onnx_model": false,
85
+ "transformers_version": "4.26.1"
86
+ }
openvino_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2bd7b9fca63d9cc55062f113be7fc9e0f09198c46210121ca552c028c6be09ba
3
+ size 53243008
openvino_model.xml ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1d102968cdc867d532e4cc6cc5942b70ca164644f7dd9b89c374748f84fbf2d4
3
+ size 10499106
original_graph.dot ADDED
The diff for this file is too large to render. See raw diff
 
preprocessor_config.json ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "do_normalize": true,
3
+ "do_rescale": true,
4
+ "do_resize": true,
5
+ "feature_extractor_type": "ViTFeatureExtractor",
6
+ "image_mean": [
7
+ 0.485,
8
+ 0.456,
9
+ 0.406
10
+ ],
11
+ "image_processor_type": "ViTFeatureExtractor",
12
+ "image_std": [
13
+ 0.229,
14
+ 0.224,
15
+ 0.225
16
+ ],
17
+ "resample": 3,
18
+ "rescale_factor": 0.00392156862745098,
19
+ "size": {
20
+ "height": 224,
21
+ "width": 224
22
+ }
23
+ }
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d68205879c144dac9178da1ed9a424a9de9f3a8b266accd009dddd254345b347
3
+ size 685689463
structured_sparsity.csv ADDED
@@ -0,0 +1,145 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ,group_id,type,torch_module,weight_shape,pruned_weight_shape,bias_shape,pruned_bias_shape,head_or_channel_id_to_keep,module_node_name
2
+ 0,0,MHSA,nncf_module.swin.encoder.layers.0.blocks.0.attention.self.query,"(128, 128)","(64, 128)","(128,)","(64,)","[1, 3]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
3
+ 1,0,MHSA,nncf_module.swin.encoder.layers.0.blocks.0.attention.self.key,"(128, 128)","(64, 128)","(128,)","(64,)","[1, 3]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
4
+ 2,0,MHSA,nncf_module.swin.encoder.layers.0.blocks.0.attention.self.value,"(128, 128)","(64, 128)","(128,)","(64,)","[1, 3]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
5
+ 3,0,MHSA,nncf_module.swin.encoder.layers.0.blocks.0.attention.output.dense,"(128, 128)","(128, 64)","(128,)","(128,)","[1, 3]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
6
+ 4,1,FF,nncf_module.swin.encoder.layers.0.blocks.0.intermediate.dense,"(512, 128)","(306, 128)","(512,)","(306,)",[306 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[0]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
7
+ 5,1,FF,nncf_module.swin.encoder.layers.0.blocks.0.output.dense,"(128, 512)","(128, 306)","(128,)","(128,)",[306 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[0]/SwinOutput[output]/NNCFLinear[dense]/linear_0
8
+ 6,2,MHSA,nncf_module.swin.encoder.layers.0.blocks.1.attention.self.query,"(128, 128)","(32, 128)","(128,)","(32,)",[3],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
9
+ 7,2,MHSA,nncf_module.swin.encoder.layers.0.blocks.1.attention.self.key,"(128, 128)","(32, 128)","(128,)","(32,)",[3],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
10
+ 8,2,MHSA,nncf_module.swin.encoder.layers.0.blocks.1.attention.self.value,"(128, 128)","(32, 128)","(128,)","(32,)",[3],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
11
+ 9,2,MHSA,nncf_module.swin.encoder.layers.0.blocks.1.attention.output.dense,"(128, 128)","(128, 32)","(128,)","(128,)",[3],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
12
+ 10,3,FF,nncf_module.swin.encoder.layers.0.blocks.1.intermediate.dense,"(512, 128)","(404, 128)","(512,)","(404,)",[404 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[1]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
13
+ 11,3,FF,nncf_module.swin.encoder.layers.0.blocks.1.output.dense,"(128, 512)","(128, 404)","(128,)","(128,)",[404 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[0]/ModuleList[blocks]/SwinLayer[1]/SwinOutput[output]/NNCFLinear[dense]/linear_0
14
+ 12,4,MHSA,nncf_module.swin.encoder.layers.1.blocks.0.attention.self.query,"(256, 256)","(96, 256)","(256,)","(96,)","[1, 3, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
15
+ 13,4,MHSA,nncf_module.swin.encoder.layers.1.blocks.0.attention.self.key,"(256, 256)","(96, 256)","(256,)","(96,)","[1, 3, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
16
+ 14,4,MHSA,nncf_module.swin.encoder.layers.1.blocks.0.attention.self.value,"(256, 256)","(96, 256)","(256,)","(96,)","[1, 3, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
17
+ 15,4,MHSA,nncf_module.swin.encoder.layers.1.blocks.0.attention.output.dense,"(256, 256)","(256, 96)","(256,)","(256,)","[1, 3, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
18
+ 16,5,FF,nncf_module.swin.encoder.layers.1.blocks.0.intermediate.dense,"(1024, 256)","(782, 256)","(1024,)","(782,)",[782 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[0]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
19
+ 17,5,FF,nncf_module.swin.encoder.layers.1.blocks.0.output.dense,"(256, 1024)","(256, 782)","(256,)","(256,)",[782 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[0]/SwinOutput[output]/NNCFLinear[dense]/linear_0
20
+ 18,6,MHSA,nncf_module.swin.encoder.layers.1.blocks.1.attention.self.query,"(256, 256)","(96, 256)","(256,)","(96,)","[0, 1, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
21
+ 19,6,MHSA,nncf_module.swin.encoder.layers.1.blocks.1.attention.self.key,"(256, 256)","(96, 256)","(256,)","(96,)","[0, 1, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
22
+ 20,6,MHSA,nncf_module.swin.encoder.layers.1.blocks.1.attention.self.value,"(256, 256)","(96, 256)","(256,)","(96,)","[0, 1, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
23
+ 21,6,MHSA,nncf_module.swin.encoder.layers.1.blocks.1.attention.output.dense,"(256, 256)","(256, 96)","(256,)","(256,)","[0, 1, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
24
+ 22,7,FF,nncf_module.swin.encoder.layers.1.blocks.1.intermediate.dense,"(1024, 256)","(807, 256)","(1024,)","(807,)",[807 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[1]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
25
+ 23,7,FF,nncf_module.swin.encoder.layers.1.blocks.1.output.dense,"(256, 1024)","(256, 807)","(256,)","(256,)",[807 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[1]/ModuleList[blocks]/SwinLayer[1]/SwinOutput[output]/NNCFLinear[dense]/linear_0
26
+ 24,8,MHSA,nncf_module.swin.encoder.layers.2.blocks.0.attention.self.query,"(512, 512)","(192, 512)","(512,)","(192,)","[3, 5, 8, 9, 10, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
27
+ 25,8,MHSA,nncf_module.swin.encoder.layers.2.blocks.0.attention.self.key,"(512, 512)","(192, 512)","(512,)","(192,)","[3, 5, 8, 9, 10, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
28
+ 26,8,MHSA,nncf_module.swin.encoder.layers.2.blocks.0.attention.self.value,"(512, 512)","(192, 512)","(512,)","(192,)","[3, 5, 8, 9, 10, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
29
+ 27,8,MHSA,nncf_module.swin.encoder.layers.2.blocks.0.attention.output.dense,"(512, 512)","(512, 192)","(512,)","(512,)","[3, 5, 8, 9, 10, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
30
+ 28,9,FF,nncf_module.swin.encoder.layers.2.blocks.0.intermediate.dense,"(2048, 512)","(1183, 512)","(2048,)","(1183,)",[1183 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[0]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
31
+ 29,9,FF,nncf_module.swin.encoder.layers.2.blocks.0.output.dense,"(512, 2048)","(512, 1183)","(512,)","(512,)",[1183 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[0]/SwinOutput[output]/NNCFLinear[dense]/linear_0
32
+ 30,10,MHSA,nncf_module.swin.encoder.layers.2.blocks.1.attention.self.query,"(512, 512)","(288, 512)","(512,)","(288,)","[1, 6, 7, 9, 10, 11, 12, 13, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
33
+ 31,10,MHSA,nncf_module.swin.encoder.layers.2.blocks.1.attention.self.key,"(512, 512)","(288, 512)","(512,)","(288,)","[1, 6, 7, 9, 10, 11, 12, 13, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
34
+ 32,10,MHSA,nncf_module.swin.encoder.layers.2.blocks.1.attention.self.value,"(512, 512)","(288, 512)","(512,)","(288,)","[1, 6, 7, 9, 10, 11, 12, 13, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
35
+ 33,10,MHSA,nncf_module.swin.encoder.layers.2.blocks.1.attention.output.dense,"(512, 512)","(512, 288)","(512,)","(512,)","[1, 6, 7, 9, 10, 11, 12, 13, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
36
+ 34,11,FF,nncf_module.swin.encoder.layers.2.blocks.1.intermediate.dense,"(2048, 512)","(1249, 512)","(2048,)","(1249,)",[1249 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[1]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
37
+ 35,11,FF,nncf_module.swin.encoder.layers.2.blocks.1.output.dense,"(512, 2048)","(512, 1249)","(512,)","(512,)",[1249 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[1]/SwinOutput[output]/NNCFLinear[dense]/linear_0
38
+ 36,12,MHSA,nncf_module.swin.encoder.layers.2.blocks.2.attention.self.query,"(512, 512)","(288, 512)","(512,)","(288,)","[1, 4, 5, 6, 7, 8, 9, 11, 12]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[2]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
39
+ 37,12,MHSA,nncf_module.swin.encoder.layers.2.blocks.2.attention.self.key,"(512, 512)","(288, 512)","(512,)","(288,)","[1, 4, 5, 6, 7, 8, 9, 11, 12]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[2]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
40
+ 38,12,MHSA,nncf_module.swin.encoder.layers.2.blocks.2.attention.self.value,"(512, 512)","(288, 512)","(512,)","(288,)","[1, 4, 5, 6, 7, 8, 9, 11, 12]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[2]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
41
+ 39,12,MHSA,nncf_module.swin.encoder.layers.2.blocks.2.attention.output.dense,"(512, 512)","(512, 288)","(512,)","(512,)","[1, 4, 5, 6, 7, 8, 9, 11, 12]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[2]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
42
+ 40,13,FF,nncf_module.swin.encoder.layers.2.blocks.2.intermediate.dense,"(2048, 512)","(1228, 512)","(2048,)","(1228,)",[1228 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[2]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
43
+ 41,13,FF,nncf_module.swin.encoder.layers.2.blocks.2.output.dense,"(512, 2048)","(512, 1228)","(512,)","(512,)",[1228 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[2]/SwinOutput[output]/NNCFLinear[dense]/linear_0
44
+ 42,14,MHSA,nncf_module.swin.encoder.layers.2.blocks.3.attention.self.query,"(512, 512)","(160, 512)","(512,)","(160,)","[3, 6, 7, 9, 11]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[3]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
45
+ 43,14,MHSA,nncf_module.swin.encoder.layers.2.blocks.3.attention.self.key,"(512, 512)","(160, 512)","(512,)","(160,)","[3, 6, 7, 9, 11]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[3]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
46
+ 44,14,MHSA,nncf_module.swin.encoder.layers.2.blocks.3.attention.self.value,"(512, 512)","(160, 512)","(512,)","(160,)","[3, 6, 7, 9, 11]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[3]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
47
+ 45,14,MHSA,nncf_module.swin.encoder.layers.2.blocks.3.attention.output.dense,"(512, 512)","(512, 160)","(512,)","(512,)","[3, 6, 7, 9, 11]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[3]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
48
+ 46,15,FF,nncf_module.swin.encoder.layers.2.blocks.3.intermediate.dense,"(2048, 512)","(1206, 512)","(2048,)","(1206,)",[1206 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[3]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
49
+ 47,15,FF,nncf_module.swin.encoder.layers.2.blocks.3.output.dense,"(512, 2048)","(512, 1206)","(512,)","(512,)",[1206 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[3]/SwinOutput[output]/NNCFLinear[dense]/linear_0
50
+ 48,16,MHSA,nncf_module.swin.encoder.layers.2.blocks.4.attention.self.query,"(512, 512)","(224, 512)","(512,)","(224,)","[1, 4, 5, 6, 7, 11, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[4]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
51
+ 49,16,MHSA,nncf_module.swin.encoder.layers.2.blocks.4.attention.self.key,"(512, 512)","(224, 512)","(512,)","(224,)","[1, 4, 5, 6, 7, 11, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[4]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
52
+ 50,16,MHSA,nncf_module.swin.encoder.layers.2.blocks.4.attention.self.value,"(512, 512)","(224, 512)","(512,)","(224,)","[1, 4, 5, 6, 7, 11, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[4]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
53
+ 51,16,MHSA,nncf_module.swin.encoder.layers.2.blocks.4.attention.output.dense,"(512, 512)","(512, 224)","(512,)","(512,)","[1, 4, 5, 6, 7, 11, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[4]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
54
+ 52,17,FF,nncf_module.swin.encoder.layers.2.blocks.4.intermediate.dense,"(2048, 512)","(1189, 512)","(2048,)","(1189,)",[1189 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[4]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
55
+ 53,17,FF,nncf_module.swin.encoder.layers.2.blocks.4.output.dense,"(512, 2048)","(512, 1189)","(512,)","(512,)",[1189 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[4]/SwinOutput[output]/NNCFLinear[dense]/linear_0
56
+ 54,18,MHSA,nncf_module.swin.encoder.layers.2.blocks.5.attention.self.query,"(512, 512)","(96, 512)","(512,)","(96,)","[1, 3, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[5]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
57
+ 55,18,MHSA,nncf_module.swin.encoder.layers.2.blocks.5.attention.self.key,"(512, 512)","(96, 512)","(512,)","(96,)","[1, 3, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[5]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
58
+ 56,18,MHSA,nncf_module.swin.encoder.layers.2.blocks.5.attention.self.value,"(512, 512)","(96, 512)","(512,)","(96,)","[1, 3, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[5]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
59
+ 57,18,MHSA,nncf_module.swin.encoder.layers.2.blocks.5.attention.output.dense,"(512, 512)","(512, 96)","(512,)","(512,)","[1, 3, 5]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[5]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
60
+ 58,19,FF,nncf_module.swin.encoder.layers.2.blocks.5.intermediate.dense,"(2048, 512)","(1211, 512)","(2048,)","(1211,)",[1211 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[5]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
61
+ 59,19,FF,nncf_module.swin.encoder.layers.2.blocks.5.output.dense,"(512, 2048)","(512, 1211)","(512,)","(512,)",[1211 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[5]/SwinOutput[output]/NNCFLinear[dense]/linear_0
62
+ 60,20,MHSA,nncf_module.swin.encoder.layers.2.blocks.6.attention.self.query,"(512, 512)","(288, 512)","(512,)","(288,)","[0, 2, 3, 6, 7, 8, 12, 13, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[6]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
63
+ 61,20,MHSA,nncf_module.swin.encoder.layers.2.blocks.6.attention.self.key,"(512, 512)","(288, 512)","(512,)","(288,)","[0, 2, 3, 6, 7, 8, 12, 13, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[6]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
64
+ 62,20,MHSA,nncf_module.swin.encoder.layers.2.blocks.6.attention.self.value,"(512, 512)","(288, 512)","(512,)","(288,)","[0, 2, 3, 6, 7, 8, 12, 13, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[6]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
65
+ 63,20,MHSA,nncf_module.swin.encoder.layers.2.blocks.6.attention.output.dense,"(512, 512)","(512, 288)","(512,)","(512,)","[0, 2, 3, 6, 7, 8, 12, 13, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[6]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
66
+ 64,21,FF,nncf_module.swin.encoder.layers.2.blocks.6.intermediate.dense,"(2048, 512)","(1243, 512)","(2048,)","(1243,)",[1243 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[6]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
67
+ 65,21,FF,nncf_module.swin.encoder.layers.2.blocks.6.output.dense,"(512, 2048)","(512, 1243)","(512,)","(512,)",[1243 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[6]/SwinOutput[output]/NNCFLinear[dense]/linear_0
68
+ 66,22,MHSA,nncf_module.swin.encoder.layers.2.blocks.7.attention.self.query,"(512, 512)","(224, 512)","(512,)","(224,)","[0, 1, 4, 6, 10, 14, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[7]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
69
+ 67,22,MHSA,nncf_module.swin.encoder.layers.2.blocks.7.attention.self.key,"(512, 512)","(224, 512)","(512,)","(224,)","[0, 1, 4, 6, 10, 14, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[7]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
70
+ 68,22,MHSA,nncf_module.swin.encoder.layers.2.blocks.7.attention.self.value,"(512, 512)","(224, 512)","(512,)","(224,)","[0, 1, 4, 6, 10, 14, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[7]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
71
+ 69,22,MHSA,nncf_module.swin.encoder.layers.2.blocks.7.attention.output.dense,"(512, 512)","(512, 224)","(512,)","(512,)","[0, 1, 4, 6, 10, 14, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[7]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
72
+ 70,23,FF,nncf_module.swin.encoder.layers.2.blocks.7.intermediate.dense,"(2048, 512)","(1209, 512)","(2048,)","(1209,)",[1209 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[7]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
73
+ 71,23,FF,nncf_module.swin.encoder.layers.2.blocks.7.output.dense,"(512, 2048)","(512, 1209)","(512,)","(512,)",[1209 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[7]/SwinOutput[output]/NNCFLinear[dense]/linear_0
74
+ 72,24,MHSA,nncf_module.swin.encoder.layers.2.blocks.8.attention.self.query,"(512, 512)","(256, 512)","(512,)","(256,)","[0, 3, 4, 5, 9, 10, 13, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[8]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
75
+ 73,24,MHSA,nncf_module.swin.encoder.layers.2.blocks.8.attention.self.key,"(512, 512)","(256, 512)","(512,)","(256,)","[0, 3, 4, 5, 9, 10, 13, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[8]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
76
+ 74,24,MHSA,nncf_module.swin.encoder.layers.2.blocks.8.attention.self.value,"(512, 512)","(256, 512)","(512,)","(256,)","[0, 3, 4, 5, 9, 10, 13, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[8]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
77
+ 75,24,MHSA,nncf_module.swin.encoder.layers.2.blocks.8.attention.output.dense,"(512, 512)","(512, 256)","(512,)","(512,)","[0, 3, 4, 5, 9, 10, 13, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[8]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
78
+ 76,25,FF,nncf_module.swin.encoder.layers.2.blocks.8.intermediate.dense,"(2048, 512)","(1253, 512)","(2048,)","(1253,)",[1253 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[8]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
79
+ 77,25,FF,nncf_module.swin.encoder.layers.2.blocks.8.output.dense,"(512, 2048)","(512, 1253)","(512,)","(512,)",[1253 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[8]/SwinOutput[output]/NNCFLinear[dense]/linear_0
80
+ 78,26,MHSA,nncf_module.swin.encoder.layers.2.blocks.9.attention.self.query,"(512, 512)","(224, 512)","(512,)","(224,)","[0, 1, 2, 3, 4, 8, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[9]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
81
+ 79,26,MHSA,nncf_module.swin.encoder.layers.2.blocks.9.attention.self.key,"(512, 512)","(224, 512)","(512,)","(224,)","[0, 1, 2, 3, 4, 8, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[9]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
82
+ 80,26,MHSA,nncf_module.swin.encoder.layers.2.blocks.9.attention.self.value,"(512, 512)","(224, 512)","(512,)","(224,)","[0, 1, 2, 3, 4, 8, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[9]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
83
+ 81,26,MHSA,nncf_module.swin.encoder.layers.2.blocks.9.attention.output.dense,"(512, 512)","(512, 224)","(512,)","(512,)","[0, 1, 2, 3, 4, 8, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[9]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
84
+ 82,27,FF,nncf_module.swin.encoder.layers.2.blocks.9.intermediate.dense,"(2048, 512)","(1222, 512)","(2048,)","(1222,)",[1222 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[9]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
85
+ 83,27,FF,nncf_module.swin.encoder.layers.2.blocks.9.output.dense,"(512, 2048)","(512, 1222)","(512,)","(512,)",[1222 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[9]/SwinOutput[output]/NNCFLinear[dense]/linear_0
86
+ 84,28,MHSA,nncf_module.swin.encoder.layers.2.blocks.10.attention.self.query,"(512, 512)","(192, 512)","(512,)","(192,)","[0, 1, 5, 7, 9, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[10]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
87
+ 85,28,MHSA,nncf_module.swin.encoder.layers.2.blocks.10.attention.self.key,"(512, 512)","(192, 512)","(512,)","(192,)","[0, 1, 5, 7, 9, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[10]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
88
+ 86,28,MHSA,nncf_module.swin.encoder.layers.2.blocks.10.attention.self.value,"(512, 512)","(192, 512)","(512,)","(192,)","[0, 1, 5, 7, 9, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[10]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
89
+ 87,28,MHSA,nncf_module.swin.encoder.layers.2.blocks.10.attention.output.dense,"(512, 512)","(512, 192)","(512,)","(512,)","[0, 1, 5, 7, 9, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[10]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
90
+ 88,29,FF,nncf_module.swin.encoder.layers.2.blocks.10.intermediate.dense,"(2048, 512)","(1264, 512)","(2048,)","(1264,)",[1264 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[10]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
91
+ 89,29,FF,nncf_module.swin.encoder.layers.2.blocks.10.output.dense,"(512, 2048)","(512, 1264)","(512,)","(512,)",[1264 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[10]/SwinOutput[output]/NNCFLinear[dense]/linear_0
92
+ 90,30,MHSA,nncf_module.swin.encoder.layers.2.blocks.11.attention.self.query,"(512, 512)","(224, 512)","(512,)","(224,)","[1, 3, 6, 7, 11, 12, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[11]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
93
+ 91,30,MHSA,nncf_module.swin.encoder.layers.2.blocks.11.attention.self.key,"(512, 512)","(224, 512)","(512,)","(224,)","[1, 3, 6, 7, 11, 12, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[11]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
94
+ 92,30,MHSA,nncf_module.swin.encoder.layers.2.blocks.11.attention.self.value,"(512, 512)","(224, 512)","(512,)","(224,)","[1, 3, 6, 7, 11, 12, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[11]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
95
+ 93,30,MHSA,nncf_module.swin.encoder.layers.2.blocks.11.attention.output.dense,"(512, 512)","(512, 224)","(512,)","(512,)","[1, 3, 6, 7, 11, 12, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[11]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
96
+ 94,31,FF,nncf_module.swin.encoder.layers.2.blocks.11.intermediate.dense,"(2048, 512)","(1253, 512)","(2048,)","(1253,)",[1253 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[11]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
97
+ 95,31,FF,nncf_module.swin.encoder.layers.2.blocks.11.output.dense,"(512, 2048)","(512, 1253)","(512,)","(512,)",[1253 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[11]/SwinOutput[output]/NNCFLinear[dense]/linear_0
98
+ 96,32,MHSA,nncf_module.swin.encoder.layers.2.blocks.12.attention.self.query,"(512, 512)","(352, 512)","(512,)","(352,)","[1, 2, 3, 4, 5, 6, 7, 9, 10, 12, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[12]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
99
+ 97,32,MHSA,nncf_module.swin.encoder.layers.2.blocks.12.attention.self.key,"(512, 512)","(352, 512)","(512,)","(352,)","[1, 2, 3, 4, 5, 6, 7, 9, 10, 12, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[12]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
100
+ 98,32,MHSA,nncf_module.swin.encoder.layers.2.blocks.12.attention.self.value,"(512, 512)","(352, 512)","(512,)","(352,)","[1, 2, 3, 4, 5, 6, 7, 9, 10, 12, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[12]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
101
+ 99,32,MHSA,nncf_module.swin.encoder.layers.2.blocks.12.attention.output.dense,"(512, 512)","(512, 352)","(512,)","(512,)","[1, 2, 3, 4, 5, 6, 7, 9, 10, 12, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[12]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
102
+ 100,33,FF,nncf_module.swin.encoder.layers.2.blocks.12.intermediate.dense,"(2048, 512)","(1233, 512)","(2048,)","(1233,)",[1233 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[12]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
103
+ 101,33,FF,nncf_module.swin.encoder.layers.2.blocks.12.output.dense,"(512, 2048)","(512, 1233)","(512,)","(512,)",[1233 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[12]/SwinOutput[output]/NNCFLinear[dense]/linear_0
104
+ 102,34,MHSA,nncf_module.swin.encoder.layers.2.blocks.13.attention.self.query,"(512, 512)","(160, 512)","(512,)","(160,)","[2, 4, 5, 8, 10]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[13]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
105
+ 103,34,MHSA,nncf_module.swin.encoder.layers.2.blocks.13.attention.self.key,"(512, 512)","(160, 512)","(512,)","(160,)","[2, 4, 5, 8, 10]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[13]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
106
+ 104,34,MHSA,nncf_module.swin.encoder.layers.2.blocks.13.attention.self.value,"(512, 512)","(160, 512)","(512,)","(160,)","[2, 4, 5, 8, 10]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[13]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
107
+ 105,34,MHSA,nncf_module.swin.encoder.layers.2.blocks.13.attention.output.dense,"(512, 512)","(512, 160)","(512,)","(512,)","[2, 4, 5, 8, 10]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[13]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
108
+ 106,35,FF,nncf_module.swin.encoder.layers.2.blocks.13.intermediate.dense,"(2048, 512)","(1249, 512)","(2048,)","(1249,)",[1249 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[13]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
109
+ 107,35,FF,nncf_module.swin.encoder.layers.2.blocks.13.output.dense,"(512, 2048)","(512, 1249)","(512,)","(512,)",[1249 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[13]/SwinOutput[output]/NNCFLinear[dense]/linear_0
110
+ 108,36,MHSA,nncf_module.swin.encoder.layers.2.blocks.14.attention.self.query,"(512, 512)","(256, 512)","(512,)","(256,)","[0, 3, 4, 5, 10, 11, 14, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[14]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
111
+ 109,36,MHSA,nncf_module.swin.encoder.layers.2.blocks.14.attention.self.key,"(512, 512)","(256, 512)","(512,)","(256,)","[0, 3, 4, 5, 10, 11, 14, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[14]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
112
+ 110,36,MHSA,nncf_module.swin.encoder.layers.2.blocks.14.attention.self.value,"(512, 512)","(256, 512)","(512,)","(256,)","[0, 3, 4, 5, 10, 11, 14, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[14]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
113
+ 111,36,MHSA,nncf_module.swin.encoder.layers.2.blocks.14.attention.output.dense,"(512, 512)","(512, 256)","(512,)","(512,)","[0, 3, 4, 5, 10, 11, 14, 15]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[14]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
114
+ 112,37,FF,nncf_module.swin.encoder.layers.2.blocks.14.intermediate.dense,"(2048, 512)","(1066, 512)","(2048,)","(1066,)",[1066 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[14]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
115
+ 113,37,FF,nncf_module.swin.encoder.layers.2.blocks.14.output.dense,"(512, 2048)","(512, 1066)","(512,)","(512,)",[1066 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[14]/SwinOutput[output]/NNCFLinear[dense]/linear_0
116
+ 114,38,MHSA,nncf_module.swin.encoder.layers.2.blocks.15.attention.self.query,"(512, 512)","(224, 512)","(512,)","(224,)","[2, 4, 5, 6, 9, 10, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[15]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
117
+ 115,38,MHSA,nncf_module.swin.encoder.layers.2.blocks.15.attention.self.key,"(512, 512)","(224, 512)","(512,)","(224,)","[2, 4, 5, 6, 9, 10, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[15]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
118
+ 116,38,MHSA,nncf_module.swin.encoder.layers.2.blocks.15.attention.self.value,"(512, 512)","(224, 512)","(512,)","(224,)","[2, 4, 5, 6, 9, 10, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[15]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
119
+ 117,38,MHSA,nncf_module.swin.encoder.layers.2.blocks.15.attention.output.dense,"(512, 512)","(512, 224)","(512,)","(512,)","[2, 4, 5, 6, 9, 10, 13]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[15]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
120
+ 118,39,FF,nncf_module.swin.encoder.layers.2.blocks.15.intermediate.dense,"(2048, 512)","(949, 512)","(2048,)","(949,)",[949 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[15]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
121
+ 119,39,FF,nncf_module.swin.encoder.layers.2.blocks.15.output.dense,"(512, 2048)","(512, 949)","(512,)","(512,)",[949 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[15]/SwinOutput[output]/NNCFLinear[dense]/linear_0
122
+ 120,40,MHSA,nncf_module.swin.encoder.layers.2.blocks.16.attention.self.query,"(512, 512)","(320, 512)","(512,)","(320,)","[0, 1, 2, 3, 6, 7, 8, 9, 10, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[16]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
123
+ 121,40,MHSA,nncf_module.swin.encoder.layers.2.blocks.16.attention.self.key,"(512, 512)","(320, 512)","(512,)","(320,)","[0, 1, 2, 3, 6, 7, 8, 9, 10, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[16]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
124
+ 122,40,MHSA,nncf_module.swin.encoder.layers.2.blocks.16.attention.self.value,"(512, 512)","(320, 512)","(512,)","(320,)","[0, 1, 2, 3, 6, 7, 8, 9, 10, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[16]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
125
+ 123,40,MHSA,nncf_module.swin.encoder.layers.2.blocks.16.attention.output.dense,"(512, 512)","(512, 320)","(512,)","(512,)","[0, 1, 2, 3, 6, 7, 8, 9, 10, 14]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[16]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
126
+ 124,41,FF,nncf_module.swin.encoder.layers.2.blocks.16.intermediate.dense,"(2048, 512)","(848, 512)","(2048,)","(848,)",[848 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[16]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
127
+ 125,41,FF,nncf_module.swin.encoder.layers.2.blocks.16.output.dense,"(512, 2048)","(512, 848)","(512,)","(512,)",[848 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[16]/SwinOutput[output]/NNCFLinear[dense]/linear_0
128
+ 126,42,MHSA,nncf_module.swin.encoder.layers.2.blocks.17.attention.self.query,"(512, 512)","(128, 512)","(512,)","(128,)","[3, 4, 5, 9]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[17]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
129
+ 127,42,MHSA,nncf_module.swin.encoder.layers.2.blocks.17.attention.self.key,"(512, 512)","(128, 512)","(512,)","(128,)","[3, 4, 5, 9]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[17]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
130
+ 128,42,MHSA,nncf_module.swin.encoder.layers.2.blocks.17.attention.self.value,"(512, 512)","(128, 512)","(512,)","(128,)","[3, 4, 5, 9]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[17]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
131
+ 129,42,MHSA,nncf_module.swin.encoder.layers.2.blocks.17.attention.output.dense,"(512, 512)","(512, 128)","(512,)","(512,)","[3, 4, 5, 9]",SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[17]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
132
+ 130,43,FF,nncf_module.swin.encoder.layers.2.blocks.17.intermediate.dense,"(2048, 512)","(931, 512)","(2048,)","(931,)",[931 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[17]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
133
+ 131,43,FF,nncf_module.swin.encoder.layers.2.blocks.17.output.dense,"(512, 2048)","(512, 931)","(512,)","(512,)",[931 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[2]/ModuleList[blocks]/SwinLayer[17]/SwinOutput[output]/NNCFLinear[dense]/linear_0
134
+ 132,44,MHSA,nncf_module.swin.encoder.layers.3.blocks.0.attention.self.query,"(1024, 1024)","(1024, 1024)","(1024,)","(1024,)",[32 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
135
+ 133,44,MHSA,nncf_module.swin.encoder.layers.3.blocks.0.attention.self.key,"(1024, 1024)","(1024, 1024)","(1024,)","(1024,)",[32 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
136
+ 134,44,MHSA,nncf_module.swin.encoder.layers.3.blocks.0.attention.self.value,"(1024, 1024)","(1024, 1024)","(1024,)","(1024,)",[32 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
137
+ 135,44,MHSA,nncf_module.swin.encoder.layers.3.blocks.0.attention.output.dense,"(1024, 1024)","(1024, 1024)","(1024,)","(1024,)",[32 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[0]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
138
+ 136,45,FF,nncf_module.swin.encoder.layers.3.blocks.0.intermediate.dense,"(4096, 1024)","(1913, 1024)","(4096,)","(1913,)",[1913 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[0]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
139
+ 137,45,FF,nncf_module.swin.encoder.layers.3.blocks.0.output.dense,"(1024, 4096)","(1024, 1913)","(1024,)","(1024,)",[1913 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[0]/SwinOutput[output]/NNCFLinear[dense]/linear_0
140
+ 138,46,MHSA,nncf_module.swin.encoder.layers.3.blocks.1.attention.self.query,"(1024, 1024)","(1024, 1024)","(1024,)","(1024,)",[32 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[query]/linear_0
141
+ 139,46,MHSA,nncf_module.swin.encoder.layers.3.blocks.1.attention.self.key,"(1024, 1024)","(1024, 1024)","(1024,)","(1024,)",[32 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[key]/linear_0
142
+ 140,46,MHSA,nncf_module.swin.encoder.layers.3.blocks.1.attention.self.value,"(1024, 1024)","(1024, 1024)","(1024,)","(1024,)",[32 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfAttention[self]/NNCFLinear[value]/linear_0
143
+ 141,46,MHSA,nncf_module.swin.encoder.layers.3.blocks.1.attention.output.dense,"(1024, 1024)","(1024, 1024)","(1024,)","(1024,)",[32 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[1]/SwinAttention[attention]/SwinSelfOutput[output]/NNCFLinear[dense]/linear_0
144
+ 142,47,FF,nncf_module.swin.encoder.layers.3.blocks.1.intermediate.dense,"(4096, 1024)","(2059, 1024)","(4096,)","(2059,)",[2059 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[1]/SwinIntermediate[intermediate]/NNCFLinear[dense]/linear_0
145
+ 143,47,FF,nncf_module.swin.encoder.layers.3.blocks.1.output.dense,"(1024, 4096)","(1024, 2059)","(1024,)","(1024,)",[2059 items],SwinForImageClassification/SwinModel[swin]/SwinEncoder[encoder]/ModuleList[layers]/SwinStage[3]/ModuleList[blocks]/SwinLayer[1]/SwinOutput[output]/NNCFLinear[dense]/linear_0
train_results.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 15.0,
3
+ "train_loss": 17.32710409305085,
4
+ "train_runtime": 64003.0137,
5
+ "train_samples_per_second": 17.753,
6
+ "train_steps_per_second": 0.277
7
+ }
trainer_state.json ADDED
The diff for this file is too large to render. See raw diff
 
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a244741722e4f0b24fc9c7b571c76d6716803c7b9cf8eb866f7e8978f6d5a243
3
+ size 3771