OPEA
/

Falcon3-7B-Base-int4-sym-inc

4-bit precision

Model card Files Files and versions

cicdatopea commited on Dec 13, 2024

Commit

e212ef7

·

verified ·

1 Parent(s): 8bb71d0

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ datasets:
 ## Model Details
-This model is an int4 model with group_size 128 and symmetric quantization of [falcon-three-7b]() generated by [intel/auto-round](https://github.com/intel/auto-round). Load the model with revision `` to use AutoGPTQ format, with revision `e9aa317` to use AutoAWQ format
 ## How To Use
 ### INT4 Inference(CPU/HPU/CUDA)
@@ -18,7 +18,7 @@ tokenizer = AutoTokenizer.from_pretrained(quantized_model_dir)
 model = AutoModelForCausalLM.from_pretrained(
     quantized_model_dir,
     device_map="auto"
-    ## revision="" ##AutoGPTQ format
     ## revision="e9aa317" ##AutoAWQ format
 )
 text = "How many r in strawberry? The answer is "

 ## Model Details
+This model is an int4 model with group_size 128 and symmetric quantization of [falcon-three-7b]() generated by [intel/auto-round](https://github.com/intel/auto-round). Load the model with revision `a10e358` to use AutoGPTQ format, with revision `e9aa317` to use AutoAWQ format
 ## How To Use
 ### INT4 Inference(CPU/HPU/CUDA)
 model = AutoModelForCausalLM.from_pretrained(
     quantized_model_dir,
     device_map="auto"
+    ## revision="a10e358" ##AutoGPTQ format
     ## revision="e9aa317" ##AutoAWQ format
 )
 text = "How many r in strawberry? The answer is "