IntelLabs
/

sqft-mistral-7b-v0.3-50-base-gptq

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

jinjieyuan commited on 7 days ago

Commit

8c5d94f

•

1 Parent(s): 8f95f37

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
 language: en
 license: apache-2.0
 ---
 # SQFT Base Model: sqft-mistral-7b-v0.3-50-base-gptq
@@ -25,7 +26,7 @@ Refer to the commands in [SQFT/run_command/mistral-7b-v0.3/sparse_quantization.s
 @article{munoz2024sqft,
   title = {SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models},
   author={J. Pablo Munoz and Jinjie Yuan and Nilesh Jain},
-  journal={},
   year={2024}
 }
 ```
@@ -36,4 +37,4 @@ Thanks to the sparse algorithm [Wanda]((https://arxiv.org/abs/2306.11695)) and t
 ## License
-Apache-2.0

 ---
 language: en
 license: apache-2.0
+library_name: transformers
 ---
 # SQFT Base Model: sqft-mistral-7b-v0.3-50-base-gptq
 @article{munoz2024sqft,
   title = {SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models},
   author={J. Pablo Munoz and Jinjie Yuan and Nilesh Jain},
+  journal={The 2024 Conference on Empirical Methods in Natural Language Processing (Findings)},
   year={2024}
 }
 ```
 ## License
+Apache-2.0