Image captioning
For deployment as an inference endpoint, using a Custom task type – a fixed version of this repo (updated to file input of test your endpoint!)
Request payload
{
"inputs": ["/9j/4AAQSkZJRgABAQEBLAEsAAD/2wBDAAMCAgICAgMC...."], // base64-encoded image
}
Response payload
{
"captions": ["inferred caption for image"]
}
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the HF Inference API does not support generic models with pipeline type image-to-text