IntelligentEstate
/

Sakura_Warding-H5-Qw2.5-7B-Q4_K_M-GGUF

Inference Endpoints

Model card Files Files and versions Community

IntelligentEstate/Sakura_Warding-Qw2.5-7B-Q4_K_M-GGUF

This model was converted to GGUF format from `newsbang/Homer-v0.5-Qwen2.5-7B` using llama.cpp Refer to the original model card for more details on the model. Took a few Quantizations to get everything perfect.

Model Named for personal system use, after multiple Quants this turned out to be the most functional for me,

Downloads last month: 17

GGUF

Model size

7.62B params

Architecture

qwen2

4-bit

Inference API

Unable to determine this model's library. Check the docs .

Model tree for IntelligentEstate/Sakura_Warding-H5-Qw2.5-7B-Q4_K_M-GGUF

Base model

newsbang/Homer-v0.5-Qwen2.5-7B

Quantized

(5)

this model

Collections including IntelligentEstate/Sakura_Warding-H5-Qw2.5-7B-Q4_K_M-GGUF

SotA-GGUF

32 items • Updated about 13 hours ago • 3

SotA

42 items • Updated 4 days ago • 1