VFMs SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Paper • 2310.15308 • Published Oct 23, 2023 • 23
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Paper • 2310.15308 • Published Oct 23, 2023 • 23
VLMs Inject Semantic Concepts into Image Tagging for Open-Set Recognition Paper • 2310.15200 • Published Oct 23, 2023 • 6 Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 89
Inject Semantic Concepts into Image Tagging for Open-Set Recognition Paper • 2310.15200 • Published Oct 23, 2023 • 6
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 89