Is there any quantized version, with good performance, high accuracy that fits into 16GB

#2
by bdutta - opened
MLX Community org

Looking for a quantized, MLX optimized flavour of Qwen2.5-Coder-32B that has good performance, has high accuracy and fits into 16GB memory. Is there something available ?

Sign up or log in to comment