Most Recent Merge
Collection
7 items
•
Updated
•
1
See the main model card: https://huggingface.co./brucethemoose/Yi-34B-200K-RPMerge
Experimental imatrix quantizations, at 16K context, using a cocktail of data taken from the exllamav2 repo, kalomaze's random token tests and some stories.
Consider them experimental! But they may be better than the equivalent non-matrix GGUFs. Or they may not!
2-bit