cnfusion
/

Rombos-LLM-V2.5-Qwen-32b-Q4-mlx

HF Leaderboard pegs this as one of the highest 32B parameter model, how is the quantized Q4 version ?

#1 opened 13 days ago by