q4_0 : match AWQ format (F16 input / output tensors) 0aba27d verified ggerganov commited on Sep 2, 2024
Upload meta-llama-3.1-8b-instruct-q4_0.gguf with huggingface_hub 83066b0 verified ggerganov commited on Aug 14, 2024