Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Xenova
/
vit-gpt2-image-captioning
like
22
Image-to-Text
Transformers.js
ONNX
vision-encoder-decoder
image-text-to-text
image-captioning
Model card
Files
Files and versions
Community
1
Use this model
main
vit-gpt2-image-captioning
/
onnx
2 contributors
History:
5 commits
Xenova
HF staff
Upload fp16 ONNX weights
918fd85
verified
9 months ago
decoder_model.onnx
Safe
613 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged.onnx
Safe
615 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_merged_fp16.onnx
Safe
310 MB
LFS
Upload fp16 ONNX weights
9 months ago
decoder_model_merged_quantized.onnx
Safe
159 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_model_quantized.onnx
Safe
156 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_with_past_model.onnx
Safe
613 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
decoder_with_past_model_quantized.onnx
Safe
156 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
encoder_model.onnx
Safe
343 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
encoder_model_fp16.onnx
Safe
172 MB
LFS
Upload fp16 ONNX weights
9 months ago
encoder_model_quantized.onnx
Safe
87.5 MB
LFS
Upload folder using huggingface_hub
over 1 year ago