Meta-Llama-3.1-8B-Instruct-Q4_0-GGUF / meta-llama-3.1-8b-instruct-q4_0.gguf

Commit History

q4_0 : match AWQ format (F16 input / output tensors)
0aba27d
verified

ggerganov commited on

Upload meta-llama-3.1-8b-instruct-q4_0.gguf with huggingface_hub
83066b0
verified

ggerganov commited on