uday610 commited on
Commit
85db336
1 Parent(s): 7734a35

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - mistralai/Mistral-7B-Instruct-v0.3
5
+ ---
6
+
7
+
8
+ # mistralai/Mistral-7B-Instruct-v0.3
9
+ - ## Introduction
10
+ - Quantization Tool: Quark 0.6.0
11
+ - OGA Model Builder: v0.5.1
12
+ - ## Quantization Strategy
13
+ - AWQ / Group 128 / Asymmetric / UINT4 Weights / FP16 activations
14
+ - Excluded Layers: None
15
+ ```
16
+ python3 quantize_quark.py \
17
+ --model_dir "$model" \
18
+ --output_dir "$output_dir" \
19
+ --quant_scheme w_uint4_per_group_asym \
20
+ --num_calib_data 128 \
21
+ --quant_algo awq \
22
+ --dataset pileval_for_awq_benchmark \
23
+ --seq_len 512 \
24
+ --model_export quark_safetensors \
25
+ --data_type float16 \
26
+ --exclude_layers [] \
27
+ --custom_mode awq
28
+ ```
29
+ - ## OGA Model Builder
30
+ ```
31
+ python builder.py \
32
+ -i <quantized safetensor model dir> \
33
+ -o <oga model output dir> \
34
+ -p int4 \
35
+ -e dml
36
+ ```
37
+ - PostProcessed to generate Hybrid Model
38
+ -
39
+