Edit model card

SQFT Base Model: sqft-phi-3-mini-4k-50-base-gptq

Model Sources

How to get this model

Refer to the commands in SQFT/run_command/phi-3-mini-4k-instruct/sparse_quantization.sh.

Citation

@article{munoz2024sqft,
  title = {SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models},
  author={J. Pablo Munoz and Jinjie Yuan and Nilesh Jain},
  journal={The 2024 Conference on Empirical Methods in Natural Language Processing (Findings)},
  year={2024}
}

Acknowledgement

Thanks to the sparse algorithm Wanda and the quantization method GPTQ.

License

Apache-2.0

Downloads last month
503
Safetensors
Model size
684M params
Tensor type
I32
·
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for IntelLabs/sqft-phi-3-mini-4k-50-base-gptq

Adapters
2 models

Collection including IntelLabs/sqft-phi-3-mini-4k-50-base-gptq