Purpose of special tokens

#3
by tdeboissiere - opened

Hello !

Thanks for the detailed blog post, very helpful.
I was curious about the special tokens (e.g. ['<od>', '</od>', '<ocr>', '</ocr>']) in the Florence2Processor

  • These tokens don't seem to be used anywhere, so what is their purpose ?
  • Related: how was Florence-2 initially trained, say, for object detection ? (Were the inputs to the model the image + a text prompt such as "Locate the objects with category name in the image." + the category + the actual location of the objects in the image ?

Sign up or log in to comment