Transform video frames using text instructions
Train a custom video model
Image to 3D with DPT + 3D Point Cloud