Generate text based on prompts
Translate and generate text using a T5 model
Identify objects and poses in images