-
-
-
-
-
-
Inference status
Active filters:
gptq
Qwen/Qwen-VL-Chat-Int4
Text Generation
•
Updated
•
2.34k
•
89
TheBloke/WizardLM-1.0-Uncensored-CodeLlama-34B-GPTQ
Text Generation
•
Updated
•
30
•
7
TheBloke/storytime-13B-GPTQ
Text Generation
•
Updated
•
242
•
31
TheBloke/Qwen-14B-Chat-GPTQ
Text Generation
•
Updated
•
51
•
33
TheBloke/Mistral-7B-Instruct-v0.1-GPTQ
Text Generation
•
Updated
•
128k
•
78
le-vh/tinyllama-4bit-cpu
Text Generation
•
Updated
•
64
•
3
TheBloke/OpenHermes-2-Mistral-7B-GPTQ
Text Generation
•
Updated
•
648
•
25
TheBloke/llava-v1.5-13B-GPTQ
Text Generation
•
Updated
•
127
•
36
TheBloke/rpguild-chatml-13B-GPTQ
Text Generation
•
Updated
•
37
•
4
KoboldAI/LLaMA2-13B-Tiefighter-GPTQ
Text Generation
•
Updated
•
111
•
13
TheBloke/Mistral-7B-Claude-Chat-GPTQ
Text Generation
•
Updated
•
29
•
11
TheBloke/claude2-alpaca-7B-GPTQ
Text Generation
•
Updated
•
30
•
3
Qwen/Qwen-72B-Chat-Int4
Text Generation
•
Updated
•
484
•
46
TheBloke/Mixtral-8x7B-v0.1-GPTQ
Text Generation
•
Updated
•
1k
•
128
TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ
Text Generation
•
Updated
•
156k
•
135
TheBloke/Mistral-7B-Instruct-v0.2-GPTQ
Text Generation
•
Updated
•
532k
•
50
TheBloke/dolphin-2.5-mixtral-8x7b-GPTQ
Text Generation
•
Updated
•
188
•
106
TheBloke/finance-LLM-GPTQ
Text Generation
•
Updated
•
63
•
5
TheBloke/toxicqa-Llama2-13B-GPTQ
TheBloke/dolphin-2.7-mixtral-8x7b-GPTQ
Text Generation
•
Updated
•
1.8k
•
19
TheBloke/Nous-Hermes-2-Mixtral-8x7B-SFT-GPTQ
Text Generation
•
Updated
•
49
•
11
TheBloke/Etheria-55b-v0.1-GPTQ
Text Generation
•
Updated
•
32
•
4
TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ
Updated
•
2.48k
•
55
Qwen/Qwen1.5-14B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
184
•
20
MaziyarPanahi/Meta-Llama-3-70B-Instruct-GPTQ
Text Generation
•
Updated
•
445
•
19
nm-testing/Llama-2-7b-pruned2.4-Marlin_24
Text Generation
•
Updated
•
922
•
1
neuralmagic/Mistral-7B-Instruct-v0.3-GPTQ-4bit
Text Generation
•
Updated
•
1.24k
•
16
cookey39/Five_Phases_Mindset
Text Generation
•
Updated
•
20
•
1
Qwen/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
•
Updated
•
16.7k
•
23
neuralmagic/Meta-Llama-3-8B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
13.2k
•
2