Edit model card

SQFT Base Model: sqft-phi-3-mini-4k-instruct-base-gptq

Model Sources

Citation

@article{munoz2024sqft,
  title = {SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models},
  author={J. Pablo Munoz and Jinjie Yuan and Nilesh Jain},
  journal={The 2024 Conference on Empirical Methods in Natural Language Processing (Findings)},
  year={2024}
}

Acknowledgement

Thanks to the quantization method GPTQ.

License

Apache-2.0

Downloads last month
4
Safetensors
Model size
684M params
Tensor type
I32
·
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Collection including IntelLabs/sqft-phi-3-mini-4k-instruct-base-gptq