Edit Models filters

Inference status

Misc

AutoTrain Compatible

text-generation-inference

4-bit precision

Inference Endpoints

8-bit precision

Mixture of Experts

text-embeddings-inference

Misc with no match

Carbon Emissions

Models

4,833

Full-text search

Active filters: gptq

Qwen/Qwen-VL-Chat-Int4

Text Generation • Updated Jan 25 • 2.34k • 89

TheBloke/WizardLM-1.0-Uncensored-CodeLlama-34B-GPTQ

Text Generation • Updated Sep 27, 2023 • 30 • 7

TheBloke/storytime-13B-GPTQ

Text Generation • Updated Sep 27, 2023 • 242 • 31

TheBloke/Qwen-14B-Chat-GPTQ

Text Generation • Updated Oct 30, 2023 • 51 • 33

TheBloke/Mistral-7B-Instruct-v0.1-GPTQ

Text Generation • Updated Sep 29, 2023 • 128k • 78

le-vh/tinyllama-4bit-cpu

Text Generation • Updated Oct 8, 2023 • 64 • 3

TheBloke/OpenHermes-2-Mistral-7B-GPTQ

Text Generation • Updated Oct 16, 2023 • 648 • 25

TheBloke/llava-v1.5-13B-GPTQ

Text Generation • Updated Nov 6, 2023 • 127 • 36

TheBloke/rpguild-chatml-13B-GPTQ

Text Generation • Updated Oct 18, 2023 • 37 • 4

KoboldAI/LLaMA2-13B-Tiefighter-GPTQ

Text Generation • Updated Oct 19, 2023 • 111 • 13

TheBloke/Mistral-7B-Claude-Chat-GPTQ

Text Generation • Updated Oct 29, 2023 • 29 • 11

TheBloke/claude2-alpaca-7B-GPTQ

Text Generation • Updated Nov 10, 2023 • 30 • 3

Qwen/Qwen-72B-Chat-Int4

Text Generation • Updated Jan 4 • 484 • 46

TheBloke/Mixtral-8x7B-v0.1-GPTQ

Text Generation • Updated Dec 14, 2023 • 1k • 128

TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ

Text Generation • Updated Dec 14, 2023 • 156k • 135

TheBloke/Mistral-7B-Instruct-v0.2-GPTQ

Text Generation • Updated Dec 11, 2023 • 532k • 50

TheBloke/dolphin-2.5-mixtral-8x7b-GPTQ

Text Generation • Updated Dec 14, 2023 • 188 • 106

TheBloke/finance-LLM-GPTQ

Text Generation • Updated Dec 24, 2023 • 63 • 5

TheBloke/toxicqa-Llama2-13B-GPTQ

Updated Dec 31, 2023 • 6

TheBloke/dolphin-2.7-mixtral-8x7b-GPTQ

Text Generation • Updated Jan 1 • 1.8k • 19

TheBloke/Nous-Hermes-2-Mixtral-8x7B-SFT-GPTQ

Text Generation • Updated Jan 16 • 49 • 11

TheBloke/Etheria-55b-v0.1-GPTQ

Text Generation • Updated Jan 26 • 32 • 4

TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ

Updated Jan 31 • 2.48k • 55

Qwen/Qwen1.5-14B-Chat-GPTQ-Int4

Text Generation • Updated Apr 30 • 184 • 20

MaziyarPanahi/Meta-Llama-3-70B-Instruct-GPTQ

Text Generation • Updated Apr 19 • 445 • 19

nm-testing/Llama-2-7b-pruned2.4-Marlin_24

Text Generation • Updated May 15 • 922 • 1

neuralmagic/Mistral-7B-Instruct-v0.3-GPTQ-4bit

Text Generation • Updated Jun 10 • 1.24k • 16

cookey39/Five_Phases_Mindset

Text Generation • Updated May 27 • 20 • 1

Qwen/Qwen2-7B-Instruct-GPTQ-Int4

Text Generation • Updated Aug 21 • 16.7k • 23

neuralmagic/Meta-Llama-3-8B-Instruct-quantized.w8a16

Text Generation • Updated Jul 18 • 13.2k • 2