Lewdiculous
/

Model-Requests

Model card Files Files and versions Community

Model-Requests / README.md

Lewdiculous's picture

Update README.md

8fc65ee verified 9 months ago

|

2.26 kB

	---
	license: cc-by-4.0
	tags:
	- requests
	- gguf
	- quantized
	---
	> [!WARNING]
	> Notice: <br>
	> Requests are paused at the moment due to unforseen circumstances.

	> [!TIP]
	> Support: <br>
	> My upload speeds have been cooked and unstable lately. <br>
	> Realistically I'd need to move to get a better provider. <br>
	> If you want and you are able to... <br>
	> [You can support my various endeavors here (Ko-fi).](https://ko-fi.com/Lewdiculous) <br>
	> I apologize for disrupting your experience.


	![requests-banner/png](https://huggingface.co./Lewdiculous/Model-Requests/resolve/main/requests-banner.png)

	# Welcome to my GGUF-IQ-Imatrix Model Quantization Requests card!

	Please read everything.

	This card is meant only to request GGUF-IQ-Imatrix quants for models that meet the requirements bellow.

	Requirements to request GGUF-Imatrix model quantizations:

	For the model:
	- Maximum model parameter size of 11B. <br>
	At the moment I am unable to accept requests for larger models due to hardware/time limitations.
	Preferably for Mistral based models in the creative/roleplay niche.

	Important:
	- Fill the request template as outlined in the next section.

	#### How to request a model quantization:

	1. Open a [New Discussion](https://huggingface.co./Lewdiculous/Model-Requests/discussions/new) titled "`Request: Model-Author/Model-Name`", for example, "`Request: Nitral-AI/Infinitely-Laydiculous-7B`", without the quotation marks.

	2. Include the following template in your post and fill the required information ([example request here](https://huggingface.co./Lewdiculous/Model-Requests/discussions/1)):

	```
	[Required] Model name:


	[Required] Model link:


	[Required] Brief description:


	[Required] An image/direct image link to represent the model (square shaped):


	[Optional] Additonal quants (if you want any):

	<!-- Keep in mind that anything bellow I/Q3 isn't recommended, -->
	<!-- since for these smaller models the results will likely be -->
	<!-- highly incoherent rendering them unusable for your needs. -->


	Default list of quants for reference:

	"IQ3_M", "IQ3_XXS",
	"Q4_K_M", "Q4_K_S", "IQ4_NL", "IQ4_XS",
	"Q5_K_M", "Q5_K_S",
	"Q6_K",
	"Q8_0"

	```