Model Description

This repo contains ONNX exports for the multilingual CLIP model M-CLIP/XLM-Roberta-Large-Vit-B-16Plus. It separates the visual and textual encoders into separate models for the purpose of generating image and text embeddings.

This repo is specifically intended for use with Immich, a self-hosted photo library.

Downloads last month
5,278
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Collection including immich-app/XLM-Roberta-Large-Vit-B-16Plus