Pre-trained MLP adapter from Video-LLaVA