Any chance your team is working on a 4-bit Llama-3.2-90B-Vision-Instruct-quantized.w4a16 version?

by mrhendrey - opened Sep 28, 2024

Sep 28, 2024

Love the work that you do. Hoping you are going to put out some of the 4-bit quantized versions in the near future. thank you

mgoin

Neural Magic org Oct 17, 2024

Appreciate it! Yes we are working on enabling quantization flows with calibration for VLMs

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment