IntelligentEstate/Sakura_Warding-Qw2.5-7B-Q4_K_M-GGUF

This model was converted to GGUF format from newsbang/Homer-v0.5-Qwen2.5-7B using llama.cpp Refer to the original model card for more details on the model. Took a few Quantizations to get everything perfect.

Model Named for personal system use, after multiple Quants this turned out to be the most functional for me,

Downloads last month
17
GGUF
Model size
7.62B params
Architecture
qwen2

4-bit

Inference API
Unable to determine this model's library. Check the docs .

Model tree for IntelligentEstate/Sakura_Warding-H5-Qw2.5-7B-Q4_K_M-GGUF

Quantized
(5)
this model

Collections including IntelligentEstate/Sakura_Warding-H5-Qw2.5-7B-Q4_K_M-GGUF