bgsach commited on
Commit
00a0f06
1 Parent(s): 45d80f4

Upload folder using huggingface_hub

Browse files
Files changed (4) hide show
  1. README.md +19 -0
  2. config.json +6 -0
  3. model.bin +3 -0
  4. vocabulary.json +0 -0
README.md ADDED
@@ -0,0 +1,19 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ license: llama2
4
+ tags:
5
+ - code
6
+ ---
7
+
8
+ This is a *int8_float16* quantized version of **WizardLM/WizardCoder-Python-7B-V1.0**, quantized using [ctranslate2](https://github.com/OpenNMT/CTranslate2) (see inference instructions there).
9
+
10
+ **The license/caveats/intended usage is the same as the original model**.
11
+ The quality of its output may have
12
+ been negatively affected by the quantization process.
13
+
14
+
15
+ The command run to quantize the model was:
16
+
17
+ `ct2-transformers-converter --model ./models-hf/WizardLM/WizardCoder-Python-7B-V1.0 --quantization int8_float16 --output_dir ./models-ct/WizardLM/WizardCoder-Python-7B-V1.0-ct2-int8_float16`
18
+
19
+ The quantization was run on a 'high-mem', CPU only (8 core, 51GB) colab instance and took approximately 10 minutes.
config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": "</s>",
3
+ "eos_token": "</s>",
4
+ "layer_norm_epsilon": 1e-05,
5
+ "unk_token": "</s>"
6
+ }
model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14b0bfbd0586dfafd5ba579e2cf386bbb419c2308f30e8ba1be8e41047d6b9fd
3
+ size 6744414033
vocabulary.json ADDED
The diff for this file is too large to render. See raw diff