Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Xenova
/
vit-gpt2-image-captioning
like
22
Image-to-Text
Transformers.js
ONNX
vision-encoder-decoder
image-text-to-text
image-captioning
Model card
Files
Files and versions
Community
1
Use this model
7e763a4
vit-gpt2-image-captioning
/
onnx
2 contributors
History:
2 commits
Xenova
HF staff
Upload folder using huggingface_hub
7e763a4
over 1 year ago
decoder_model.onnx
Safe
768 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged.onnx
Safe
768 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged_quantized.onnx
Safe
196 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_quantized.onnx
Safe
195 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_with_past_model.onnx
Safe
768 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_with_past_model_quantized.onnx
Safe
195 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
encoder_model.onnx
Safe
343 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
encoder_model_quantized.onnx
Safe
87.5 MB
LFS
Upload folder using huggingface_hub
over 1 year ago