Thireus
/

WizardLM-70B-V1.0-HF-4.0bpw-h6-exl2

Text Generation

Model card Files Files and versions Community

Thireus commited on Sep 17, 2023

Commit

71ec21e

•

1 Parent(s): 5cfe9bb

Update README.md

Files changed (1) hide show

README.md +9 -5

README.md CHANGED Viewed

@@ -49,9 +49,9 @@ ASSISTANT:
 ## Quantization process:
-| Original Model | → | Float16 Model | → | Safetensor Model | → | EXL2 Model |
 | -------------- | --- | ------------- | --- | ---------------- | --- | ---------- |
-| [WizardLM 70B V1.0](https://huggingface.co/WizardLM/WizardLM-70B-V1.0) | → | [WizardLM 70B V1.0-HF](https://huggingface.co/simsim314/WizardLM-70B-V1.0-HF) | → | Safetensor* | → | EXL2 |
 Example to convert WizardLM-70B-V1.0-HF_float16_safetensored to EXL2 4.0 bpw with 6-bit head:
@@ -60,9 +60,13 @@ mkdir -p ~/EXL2/WizardLM-70B-V1.0-HF_4bit # Create the output directory
 python convert.py -i ~/float16_safetensored/WizardLM-70B-V1.0-HF -o ~/EXL2/WizardLM-70B-V1.0-HF_4bit -c ~/EXL2/0000.parquet -b 4.0 -hb 6
 ```
-\* Use any one of the following scripts to convert your local pytorch_model bin files to safetensors:
 - https://github.com/turboderp/exllamav2/blob/master/util/convert_safetensors.py (official ExLlamaV2)
 - https://huggingface.co/Panchovix/airoboros-l2-70b-gpt4-1.4.1-safetensors/blob/main/bin2safetensors/convert.py (recommended if model already converted to float16)
-- https://gist.github.com/epicfilemcnulty/1f55fd96b08f8d4d6693293e37b4c55e#file-2safetensors-py
-- https://github.com/oobabooga/text-generation-webui/blob/main/convert-to-safetensors.py (best for sharding and float16/FP16 or bfloat16/BF16 conversion)

 ## Quantization process:
+| Original Model | → | Float16 Model* | → | Safetensor Model** | → | EXL2 Model |
 | -------------- | --- | ------------- | --- | ---------------- | --- | ---------- |
+| [WizardLM 70B V1.0](https://huggingface.co/WizardLM/WizardLM-70B-V1.0) | → | [WizardLM 70B V1.0-HF](https://huggingface.co/simsim314/WizardLM-70B-V1.0-HF)* | → | Safetensor** | → | EXL2 |
 Example to convert WizardLM-70B-V1.0-HF_float16_safetensored to EXL2 4.0 bpw with 6-bit head:
 python convert.py -i ~/float16_safetensored/WizardLM-70B-V1.0-HF -o ~/EXL2/WizardLM-70B-V1.0-HF_4bit -c ~/EXL2/0000.parquet -b 4.0 -hb 6
 ```
+\* Use the following script to convert your local pytorch_model bin files to float16 (you can also choose bfloat16) + safetensors all in one go:
+- https://github.com/oobabooga/text-generation-webui/blob/main/convert-to-safetensors.py
+ (best for sharding and float16/FP16 or bfloat16/BF16 conversion)
+\*\* Use any one of the following scripts to convert your local pytorch_model bin files to safetensors:
 - https://github.com/turboderp/exllamav2/blob/master/util/convert_safetensors.py (official ExLlamaV2)
 - https://huggingface.co/Panchovix/airoboros-l2-70b-gpt4-1.4.1-safetensors/blob/main/bin2safetensors/convert.py (recommended if model already converted to float16)
+- https://gist.github.com/epicfilemcnulty/1f55fd96b08f8d4d6693293e37b4c55e#file-2safetensors-py