LnL-AI
/

Yi-1.5-34B-Chat-4bit-gptq-autoround

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

LiuRuiLin commited on May 15

Commit

278d441

•

1 Parent(s): 4a77fd3

Update README.md

Files changed (1) hide show

README.md +29 -3

README.md CHANGED Viewed

@@ -1,3 +1,29 @@
----
-license: unknown
----

+---
+license: unknown
+---
+### Quantize config:
+```json
+{
+  "bits": 4,
+  "group_size": 128,
+  "damp_percent": 0.01,
+  "desc_act": false,
+  "static_groups": false,
+  "sym": true,
+  "true_sequential": false,
+  "model_name_or_path": null,
+  "model_file_base_name": "model",
+  "quant_method": "gptq",
+  "checkpoint_format": "gptq",
+  "meta": {
+    "quantizer": "intel/auto-round:0.2.0.dev",
+    "packer": "autogptq:0.8.0.dev1",
+    "iters": 1000,
+    "lr": 0.001,
+    "minmax_lr": 0.001,
+    "enable_minmax_tuning": false,
+    "enable_quanted_input": true,
+    "scale_dtype": "float16"
+  }
+}
+```