Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
cnfusion
/
Rombos-LLM-V2.5-Qwen-32b-Q4-mlx
like
0
Text Generation
Transformers
Safetensors
MLX
qwen2
conversational
text-generation-inference
Inference Endpoints
4-bit precision
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
HF Leaderboard pegs this as one of the highest 32B parameter model, how is the quantized Q4 version ?
#1 opened 13 days ago by
bdutta