bartowski
/

Beyonder-4x7B-v2-exl2

Text Generation

Mixture of Experts

Model card Files Files and versions Community

bartowski commited on Jan 6

Commit

0fca6a4

•

1 Parent(s): 4d53dcd

Quant for 3.0

Files changed (1) hide show

README.md +3 -28

README.md CHANGED Viewed

@@ -1,39 +1,21 @@
 ---
-license: apache-2.0
-tags:
-- moe
 quantized_by: bartowski
-pipeline_tag: text-generation
 ---
-## Exllama v2 Quantizations of Beyonder-4x7B-v2
 Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.11">turboderp's ExLlamaV2 v0.0.11</a> for quantization.
-## The "main" branch only contains the measurement.json, download one of the other branches for the model (see below)
-Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.
 Conversion was done using the default calibration dataset.
-Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
 Original model: https://huggingface.co/mlabonne/Beyonder-4x7B-v2
-<a href="https://huggingface.co/bartowski/Beyonder-4x7B-v2-exl2/tree/3_5">3.5 bits per weight</a>
-<a href="https://huggingface.co/bartowski/Beyonder-4x7B-v2-exl2/tree/3_75">3.75 bits per weight</a>
-<a href="https://huggingface.co/bartowski/Beyonder-4x7B-v2-exl2/tree/4_5">4.5 bits per weight</a>
-<a href="https://huggingface.co/bartowski/Beyonder-4x7B-v2-exl2/tree/6_5">6.5 bits per weight</a>
 ## Download instructions
 With git:
 ```shell
-git clone --single-branch --branch 4_0 https://huggingface.co/bartowski/Beyonder-4x7B-v2-exl2
 ```
 With huggingface hub (credit to TheBloke for instructions):
@@ -42,16 +24,9 @@ With huggingface hub (credit to TheBloke for instructions):
 pip3 install huggingface-hub
 ```
-To download the `main` (only useful if you only care about measurement.json) branch to a folder called `Beyonder-4x7B-v2-exl2`:
-```shell
-mkdir Beyonder-4x7B-v2-exl2
-huggingface-cli download bartowski/Beyonder-4x7B-v2-exl2 --local-dir Beyonder-4x7B-v2-exl2 --local-dir-use-symlinks False
-```
 To download from a different branch, add the `--revision` parameter:
 ```shell
 mkdir Beyonder-4x7B-v2-exl2
-huggingface-cli download bartowski/Beyonder-4x7B-v2-exl2 --revision 4_0 --local-dir Beyonder-4x7B-v2-exl2 --local-dir-use-symlinks False
 ```

 ---
 quantized_by: bartowski
 ---
+# Exllama v2 Quantizations of Beyonder-4x7B-v2 at 3.0 bits per weight
 Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.11">turboderp's ExLlamaV2 v0.0.11</a> for quantization.
 Conversion was done using the default calibration dataset.
 Original model: https://huggingface.co/mlabonne/Beyonder-4x7B-v2
 ## Download instructions
 With git:
 ```shell
+git clone --single-branch --branch 3.0 https://huggingface.co/bartowski/Beyonder-4x7B-v2-exl2
 ```
 With huggingface hub (credit to TheBloke for instructions):
 pip3 install huggingface-hub
 ```
 To download from a different branch, add the `--revision` parameter:
 ```shell
 mkdir Beyonder-4x7B-v2-exl2
+huggingface-cli download bartowski/Beyonder-4x7B-v2-exl2 --revision 3_0 --local-dir Beyonder-4x7B-v2-exl2 --local-dir-use-symlinks False
 ```