Model Card for Model ID

This modelcard aims to be a base template for new models. It has been generated using this raw template.

Model Details

This is a Qwen 7B VL Model trained on a publically available financial documents datasets. The task was converted from a standard VQA into an OCR Model for extarcting out text from complex images.

  • Developed by: [Guneet Singh Kohli]
  • Model type: [Vision Language Model -> OCR]
  • Language(s) (NLP): [English]

Compute Infrastructure

Nvidia A100 80GB

[More Information Needed]

More Information [optional]

Reach out at [email protected]

Model Card Authors [optional]

Guneet Singh Kohli

Downloads last month
8
Safetensors
Model size
8.29B params
Tensor type
BF16
·
Inference Examples
Inference API (serverless) does not yet support transformers models for this pipeline type.

Model tree for guneetsk99/finance_qwen_VL_7B

Base model

Qwen/Qwen2-VL-7B
Finetuned
(115)
this model

Dataset used to train guneetsk99/finance_qwen_VL_7B

Space using guneetsk99/finance_qwen_VL_7B 1