@QuantFactory Can you offer gguf quants for this model?
Hey @Cran-May Phi-3 128k is not yet supported by llama.cpp. Will quantise them immediately once it’s released
· Sign up or log in to comment