This is the Florence-VL 3B Pretrained Checkpoint. Train on detailed image caption from [PixelProse](https://huggingface.co./datasets/tomg-group-umd/pixelprose) and [ShareGPT4V](https://huggingface.co./datasets/Lin-Chen/ShareGPT4V). The repository also includes the Pretrained Vision Tower.