Thireus
/

WizardLM-70B-V1.0-HF-4.0bpw-h6-exl2

Text Generation

Model card Files Files and versions Community

Thireus commited on Sep 17, 2023

Commit

99bfe63

·

1 Parent(s): 8417013

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -71,6 +71,8 @@ Example to convert [WizardLM 70B V1.0](https://huggingface.co/WizardLM/WizardLM-
 python convert-to-safetensors.py ~/original/WizardLM-70B-V1.0 --output ~/float16_safetensored/WizardLM-70B-V1.0 --max-shard-size 10GB
 ```
 \*\* Use any one of the following scripts to convert your local pytorch_model bin files to safetensors:
 - https://github.com/turboderp/exllamav2/blob/master/util/convert_safetensors.py (official ExLlamaV2)

 python convert-to-safetensors.py ~/original/WizardLM-70B-V1.0 --output ~/float16_safetensored/WizardLM-70B-V1.0 --max-shard-size 10GB
 ```
+Use `--bf16` if you'd like to try bfloat16 instead, but note that there are concerns about quantization quality – https://github.com/turboderp/exllamav2/issues/30#issuecomment-1719009289
 \*\* Use any one of the following scripts to convert your local pytorch_model bin files to safetensors:
 - https://github.com/turboderp/exllamav2/blob/master/util/convert_safetensors.py (official ExLlamaV2)