Generate captions for images in various styles
Separate audio into stems using various models
Translate text into different languages