Edit model card

Replete-LLM-V2.5-Qwen-14b

image/png

Replete-LLM-V2.5-Qwen-14b is a continues finetuned version of Qwen2.5-14B. I noticed recently that the Qwen team did not learn from my methods of continuous finetuning, the great benefits, and no downsides of it. So I took it upon myself to merge the instruct model with the base model myself using the Ties merge method

This version of the model shows higher performance than the original instruct and base models.

Quants: (Coming soon)

GGUF:

EXL2:

Benchmarks: (Coming soon)

Downloads last month
28
GGUF
Model size
14.8B params
Architecture
qwen2

5-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for Replete-AI/Replete-LLM-V2.5-Qwen-14b_Q5_k_m_GGUF

Base model

Qwen/Qwen2.5-14B
Quantized
(13)
this model