Model card for vit_base_patch16_224_waifumerge
A Vision Transformer (ViT) image feature model. Trained with Self-Supervised DINO method.
Model Stock merge of vit_base_patch16_224.dino, vit_base_patch16_224.augreg2_in21k_ft_in1k and wd-vit-tagger-v3 just to see what would happen ¯_(ツ)_/¯
- Downloads last month
- 0
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the HF Inference API does not support timm models with pipeline type image-feature-extraction