Edit Models filters

Inference status

Misc

8-bit precision

Misc with no match

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

19

Full-text search

Active filters: quark

fxmarty/llama-tiny-testing-quark-indev

Updated Oct 3, 2024 • 6

fxmarty/llama-tiny-int4-per-group-sym

Updated Oct 25, 2024 • 10

fxmarty/llama-tiny-w-fp8-a-fp8

Updated Oct 22, 2024 • 5

fxmarty/llama-tiny-w-fp8-a-fp8-o-fp8

Updated Oct 22, 2024 • 4

fxmarty/llama-tiny-w-int8-per-tensor

Updated Oct 22, 2024 • 6

fxmarty/llama-small-int4-per-group-sym-awq

Updated Oct 29, 2024 • 15

fxmarty/quark-legacy-int8

Updated Oct 10, 2024 • 19

fxmarty/llama-tiny-w-int8-b-int8-per-tensor

Updated Oct 22, 2024 • 13

fxmarty/llama-small-int4-per-group-sym-awq-old

Updated Oct 25, 2024 • 6

amd-quark/llama-tiny-w-int8-per-tensor

Updated Dec 18, 2024 • 239

amd-quark/llama-tiny-w-int8-b-int8-per-tensor

Updated Dec 18, 2024 • 239

amd-quark/llama-tiny-w-fp8-a-fp8

Updated Dec 18, 2024 • 237

amd-quark/llama-tiny-w-fp8-a-fp8-o-fp8

Updated Dec 18, 2024 • 240

amd-quark/llama-tiny-int4-per-group-sym

Updated Dec 18, 2024 • 237

amd-quark/llama-small-int4-per-group-sym-awq

Updated Dec 18, 2024 • 239

amd-quark/quark-legacy-int8

Updated Dec 18, 2024 • 208

amd/Llama-3.1-8B-Instruct-FP8-KV-Quark-test

Updated 19 days ago • 415

amd/Llama-3.1-8B-Instruct-w-int8-a-int8-sym-test

Updated 19 days ago • 87

EmbeddedLLM/Llama-3.1-8B-Instruct-w_fp8_per_channel_sym

Text Generation • Updated 4 days ago • 4