qwp4w3hyb's picture
Update README.md
b1c942e verified
|
raw
history blame
375 Bytes
---
license: gemma
language:
- en
pipeline_tag: text-generation
tags:
- google
- gemma
- gguf
- imatrix
---
# Quant Infos
- f32 gguf is from the official kaggle repo
- imatrix quants are running and will be uploaded one-by-one
- you will need the gemma2 llama.cpp [PR](https://github.com/ggerganov/llama.cpp/pull/8156) applied to your llama.cpp
# Original Model Card
TODO