Post
666
Squeezing out tensor bits, part III and final (for now 😉)
(For context please see: https://huggingface.co./posts/eaddario/832567461491467)
I have just finished uploading eaddario/Hammer2.1-7b-GGUF and eaddario/Dolphin3.0-Mistral-24B-GGUF.
While I was able to get 7+% reduction with Hammer2.1-7b, the larger Dolphin3.0-Mistral-24B proved to be a more difficult nut to crack (only 3%).
I have an idea as to why this was the case, which I'll test with QwQ-32B, but it will be a while before I can find the time.
(For context please see: https://huggingface.co./posts/eaddario/832567461491467)
I have just finished uploading eaddario/Hammer2.1-7b-GGUF and eaddario/Dolphin3.0-Mistral-24B-GGUF.
While I was able to get 7+% reduction with Hammer2.1-7b, the larger Dolphin3.0-Mistral-24B proved to be a more difficult nut to crack (only 3%).
I have an idea as to why this was the case, which I'll test with QwQ-32B, but it will be a while before I can find the time.