qwp4w3hyb
/

gemma-2-9b-it-iMat-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

gemma-2-9b-it-iMat-GGUF / README.md

qwp4w3hyb's picture

Update README.md

b1c942e verified 8 months ago

|

375 Bytes

	---
	license: gemma
	language:
	- en
	pipeline_tag: text-generation
	tags:
	- google
	- gemma
	- gguf
	- imatrix
	---

	# Quant Infos

	- f32 gguf is from the official kaggle repo
	- imatrix quants are running and will be uploaded one-by-one
	- you will need the gemma2 llama.cpp [PR](https://github.com/ggerganov/llama.cpp/pull/8156) applied to your llama.cpp

	# Original Model Card

	TODO