File size: 355 Bytes
362b233
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
---
license: mit
---

Original model: https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct 

Quantitation documentation: https://docs.openvino.ai/nightly/notebooks/qwen2-vl-with-output.html

Quantitation config: 
```python
import nncf

compression_configuration = {
    "mode": nncf.CompressWeightsMode.INT4_ASYM,
    "group_size": 64,
    "ratio": 1.0,
}
```