Generate depth maps from images
Find objects in images using text prompts
Generate anime character speech from text