Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
llama-cpp
Inference Endpoints
Merge
Eval Results
text-generation-inference
AutoTrain Compatible
Mixture of Experts
Carbon Emissions
custom_code
4-bit precision
text-embeddings-inference
8-bit precision
Apply filters
Models
11,293
Full-text search
Edit filters
Sort: Trending
Active filters:
llama-cpp
Clear all
Triangle104/L3.1-Aglow-Vulca-v0.1-8B-Q4_K_M-GGUF
Updated
Oct 3, 2024
•
1
•
1
Triangle104/L3.1-Aglow-Vulca-v0.1-8B-Q5_K_S-GGUF
Updated
Oct 3, 2024
•
2
•
1
Triangle104/L3.1-Aglow-Vulca-v0.1-8B-Q5_K_M-GGUF
Updated
Oct 3, 2024
•
1
•
1
Triangle104/L3.1-Aglow-Vulca-v0.1-8B-Q6_K-GGUF
Updated
Oct 3, 2024
•
1
•
1
Triangle104/L3.1-Aglow-Vulca-v0.1-8B-Q8_0-GGUF
Updated
Oct 3, 2024
•
1
•
1
ZeroXClem/L3SAO-Mix-SuperHermes-NovaPurosani-8B-Q8_0-GGUF
Updated
Oct 5, 2024
•
1
ZeroXClem/L3SAO-Mix-SuperHermes-NovaPurosani-8B-Q4_0-GGUF
Updated
Oct 5, 2024
•
12
•
2
matrixportal/gemma-2-2b-Q4_K_M-GGUF
Text Generation
•
Updated
Oct 6, 2024
•
4
•
1
matrixportal/gemma-2-2b-Instruct-Finetune-turkishReviews-Q4_K_M-GGUF
Updated
Oct 6, 2024
•
1
matrixportal/gemma-2-2b-it-Q4_K_M-GGUF
Text Generation
•
Updated
Oct 6, 2024
•
1
•
1
matrixportal/gemma-2-9b-it-Q4_K_M-GGUF
Text Generation
•
Updated
Oct 6, 2024
•
1
•
1
matrixportal/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Oct 6, 2024
•
1
•
1
ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B-Q5_0-GGUF
Updated
Oct 7, 2024
•
2
•
2
ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B-Q4_K_M-GGUF
Updated
Oct 8, 2024
•
3
•
2
ZeroXClem/L3SAO-Mix-SuperHermes-NovaPurosani-8B-Q4_K_M-GGUF
Updated
Oct 10, 2024
•
3
•
1
ZeroXClem/L3SAO-Mix-SuperHermes-NovaPurosani-8B-Q5_0-GGUF
Updated
Oct 10, 2024
•
1
•
2
ZeroXClem/L3SAO-Mix-SuperHermes-NovaPurosani-8B-Q4_K_S-GGUF
Updated
Oct 10, 2024
•
1
aimlresearch2023/llama-3.3-1b-it-merged-Q6_K-GGUF
Updated
Oct 10, 2024
•
218
•
1
ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF
Text Generation
•
Updated
Oct 28, 2024
•
82
•
2
openerotica/writing-roleplay-20k-context-nemo-12b-v1.0-gguf
Updated
Oct 16, 2024
•
2.29k
•
18
ZeroXClem/Astral-Fusion-Neural-Happy-L3.1-8B-Q8_0-GGUF
Updated
Oct 14, 2024
•
1
•
3
ZeroXClem/Astral-Fusion-Neural-Happy-L3.1-8B-Q6_K-GGUF
Updated
Oct 14, 2024
•
3
•
3
ZeroXClem/Astral-Fusion-Neural-Happy-L3.1-8B-Q5_0-GGUF
Updated
Oct 14, 2024
•
1
•
3
ZeroXClem/Astral-Fusion-Neural-Happy-L3.1-8B-Q4_0-GGUF
Updated
Oct 14, 2024
•
12
•
3
ZeroXClem/Astral-Fusion-Neural-Happy-L3.1-8B-Q4_K_M-GGUF
Updated
Oct 14, 2024
•
3
ZeroXClem/Astral-Fusion-Neural-Happy-L3.1-8B-Q4_K_S-GGUF
Updated
Oct 14, 2024
•
1
•
2
ZeroXClem/Astral-Fusion-Neural-Happy-L3.1-8B-Q5_K_S-GGUF
Updated
Oct 14, 2024
•
2
•
1
ZeroXClem/Astral-Fusion-Neural-Happy-L3.1-8B-Q5_K_M-GGUF
Updated
Oct 14, 2024
•
3
samarth1029/Gemma-2-9b-baymax-Q4_K_M-GGUF
Updated
Nov 2, 2024
•
1
ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B-Q5_K_M-GGUF
Updated
Oct 17, 2024
•
2
Previous
1
2
3
4
5
...
100
Next