Model Card for AVA Image Clip Embeddings

The AVA image dataset is a collection of digital photos with ratings. It was used to create the visual scorer that evaluated the images in Laion 5B to create the the Laion-Aesthetics dataset

https://github.com/imfing/ava_downloader/

“AVA: A Large-Scale Database for Aesthetic Visual Analysis”.

Naila Murray, Luca Marchesotti, Florent Perronnin, CVPR 2012.

New aesthetics scorer: https://github.com/kenjiqq/aesthetics-scorer/

Original aesthetics scorer: https://github.com/christophschuhmann/improved-aesthetic-predictor/

They were processed with OpenClip BigG-14, L-14, and H-14 models.

  • "laion/CLIP-ViT-bigG-14-laion2B-39B-b160k"

  • "laion/CLIP-ViT-H-14-laion2B-s32B-b79K"

  • "laion/CLIP-ViT-L-14-laion2B-s32B-b82K"

https://github.com/mlfoundations/open_clip

Not all images were processed!

Refer to the parquet for the succesfully processed images.

The parquet fields are

  • "image_name", #same id as AVA csv
  • "pooled_output"
  • "projected_embedding"
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.