Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
fp8
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
8-bit precision
Merge
Eval Results
Mixture of Experts
Misc with no match
4-bit precision
text-embeddings-inference
Carbon Emissions
Apply filters
Models
317
Full-text search
Edit filters
Sort: Trending
Active filters:
fp8
Clear all
neuralmagic/Mixtral-8x7B-Instruct-v0.1-FP8
Text Generation
•
Updated
Jul 18
•
1.1k
•
3
nm-testing/opt-125m-fp8-dynamic
Text Generation
•
Updated
Apr 27
•
18
anyisalin/Meta-Llama-3-8B-Instruct-FP8
Text Generation
•
Updated
May 6
•
10
anyisalin/Meta-Llama-3-8B-Instruct-FP8-D
Text Generation
•
Updated
Apr 28
•
9
anyisalin/lzlv_70b_fp16_hf-FP8-D
Text Generation
•
Updated
Apr 28
•
11
anyisalin/Meta-Llama-3-70B-Instruct-FP8-D
Text Generation
•
Updated
Apr 28
•
11
anyisalin/Mixtral-8x7B-Instruct-v0.1-FP8-D
Text Generation
•
Updated
Apr 28
•
12
nm-testing/llama-3-instruct-fp8-static-shared-scales
Text Generation
•
Updated
Apr 28
•
10
nm-testing/llama-3-instruct-fp8-dynamic-shared-scales
Text Generation
•
Updated
Apr 28
•
10
pcmoritz/Mixtral-8x7B-v0.1-fp8-act-scale
Text Generation
•
Updated
May 2
•
16
anyisalin/Meta-Llama-3-70B-Instruct-FP8
Text Generation
•
Updated
May 8
•
10
neuralmagic/Meta-Llama-3-8B-Instruct-FP8-KV
Text Generation
•
Updated
Jun 19
•
8.33k
•
6
comaniac/Meta-Llama-3-8B-Instruct-FP8-v1
Text Generation
•
Updated
May 24
•
6
comaniac/Mixtral-8x22B-Instruct-v0.1-FP8-v1
Text Generation
•
Updated
May 28
•
11
comaniac/Meta-Llama-3-70B-Instruct-FP8-v1
Text Generation
•
Updated
May 26
•
9
comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v1
Text Generation
•
Updated
May 26
•
12
comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v2
Text Generation
•
Updated
Jun 10
•
11
Skywork/Skywork-MoE-Base-FP8
Text Generation
•
Updated
Jul 31
•
14
•
6
comaniac/Meta-Llama-3-70B-Instruct-FP8-v2
Text Generation
•
Updated
Jun 10
•
22
comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v3
Text Generation
•
Updated
Jun 10
•
18
comaniac/Mixtral-8x22B-Instruct-v0.1-FP8-v2
Text Generation
•
Updated
Jun 10
•
21
nm-testing/granite-20b-code-base-FP8
Text Generation
•
Updated
Jun 12
•
14
nm-testing/granite-3b-code-base-FP8
Text Generation
•
Updated
Jun 12
•
12
fr00000/dolp-fp8
Text Generation
•
Updated
Jun 13
•
12
neuralmagic/Qwen2-0.5B-Instruct-FP8
Text Generation
•
Updated
Jul 18
•
1.45k
•
2
nm-testing/opt-125m-fp8-static-kv
Text Generation
•
Updated
Jun 14
•
17
neuralmagic/Qwen2-1.5B-Instruct-FP8
Text Generation
•
Updated
Jul 18
•
29
neuralmagic/Qwen2-7B-Instruct-FP8
Text Generation
•
Updated
Jul 18
•
736
•
1
anyisalin/L3-70B-Euryale-v2.1-FP8
Text Generation
•
Updated
Jun 18
•
375
nm-testing/Qwen2-0.5B-Instruct-FP8-KV
Text Generation
•
Updated
Jun 18
•
15
Previous
1
2
3
4
...
11
Next