Command to change an image with a sentence
Transform video frames using text instructions
Transform images based on text instructions
Transcribe voice to text