Architecture: Resnet-18 with two modifications.
- 1 channel Conv2D as the first layer.
- 2-way output on FC layer.
Training procedure:
- Pre-trained in ImageNet.
- Further training on FashionMNIST.
- Final training on the task of predicting if Fashion-MNIST images are flipped vertically or not.