Update README.md
Browse files
README.md
CHANGED
@@ -49,7 +49,7 @@ ASSISTANT:
|
|
49 |
|
50 |
## Quantization process:
|
51 |
|
52 |
-
| Original Model | β | Float16 Model* | β | Safetensor Model** | β | EXL2 Model |
|
53 |
| -------------- | --- | ------------- | --- | ---------------- | --- | ---------- |
|
54 |
| [WizardLM 70B V1.0](https://huggingface.co/WizardLM/WizardLM-70B-V1.0) | β | [WizardLM 70B V1.0-HF](https://huggingface.co/simsim314/WizardLM-70B-V1.0-HF)* | β | Safetensor** | β | EXL2 |
|
55 |
|
@@ -65,7 +65,7 @@ python convert.py -i ~/float16_safetensored/WizardLM-70B-V1.0-HF -o ~/EXL2/Wizar
|
|
65 |
- https://github.com/oobabooga/text-generation-webui/blob/main/convert-to-safetensors.py
|
66 |
(best for sharding and float16/FP16 or bfloat16/BF16 conversion)
|
67 |
|
68 |
-
\*\* Use any one of the following scripts to convert your local
|
69 |
|
70 |
- https://github.com/turboderp/exllamav2/blob/master/util/convert_safetensors.py (official ExLlamaV2)
|
71 |
- https://huggingface.co/Panchovix/airoboros-l2-70b-gpt4-1.4.1-safetensors/blob/main/bin2safetensors/convert.py (recommended if model already converted to float16)
|
|
|
49 |
|
50 |
## Quantization process:
|
51 |
|
52 |
+
| Original Model | β | (optional but recommended) Float16 Model* | β | Safetensor Model** | β | EXL2 Model |
|
53 |
| -------------- | --- | ------------- | --- | ---------------- | --- | ---------- |
|
54 |
| [WizardLM 70B V1.0](https://huggingface.co/WizardLM/WizardLM-70B-V1.0) | β | [WizardLM 70B V1.0-HF](https://huggingface.co/simsim314/WizardLM-70B-V1.0-HF)* | β | Safetensor** | β | EXL2 |
|
55 |
|
|
|
65 |
- https://github.com/oobabooga/text-generation-webui/blob/main/convert-to-safetensors.py
|
66 |
(best for sharding and float16/FP16 or bfloat16/BF16 conversion)
|
67 |
|
68 |
+
\*\* Use any one of the following scripts to convert your local pytorch_model bin files to safetensors:
|
69 |
|
70 |
- https://github.com/turboderp/exllamav2/blob/master/util/convert_safetensors.py (official ExLlamaV2)
|
71 |
- https://huggingface.co/Panchovix/airoboros-l2-70b-gpt4-1.4.1-safetensors/blob/main/bin2safetensors/convert.py (recommended if model already converted to float16)
|