transformers==4.45.1 torch torchvision qwen-vl-utils spaces gradio tiktoken verovio