metadata

license: cc-by-4.0
tags:
  - requests
  - gguf
  - quantized

Welcome to my GGUF-IQ-Imatrix Model Quantization Requests card!

Read bellow for more information.

Requirements to request model quantizations:

For the model:

Maximum model parameter size of 11B.
At the moment I am unable to accept requests for larger models due to hardware/time limitations.

Important:

Fill the request template as outlined in the next section.

How to request a model quantization:

Open a New Discussion with a title of "Request: Model-Author/Model-Name", for example, "Request: Nitral-AI/Infinitely-Laydiculous-7B".
Include the following template in your message and fill the information (example request here):

**Model name:**


**Model link:**


**Brief description:**


**Additonal quants (if you want any):**


Default list of quants for reference:

        "Q4_K_M", "Q4_K_S", "IQ4_XS", "Q5_K_M", "Q5_K_S",
        "Q6_K", "Q8_0", "IQ3_M", "IQ3_S", "IQ3_XXS"

**An image to represent the model (square shaped):**