YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co./docs/hub/model-cards#model-card-metadata)
Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Model details:
Chain-of-Spot encourages Large Vision-Language Models to identify the region of interest (ROI) in the image condition on the question and reasoning through an interactive manner, thereby improving the ability of visual understanding.
Where to send questions or comments about the model: https://github.com/dongyh20/Chain-of-Spot
Paper or resources for more information: https://sites.google.com/view/chain-of-spot/
- Downloads last month
- 6,243
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.