ryanzhangfan's picture
add support for batch multimodal understanding
c059b33 verified