Engage in multi-modal conversations with images and videos
Calculate memory needed to train AI models