File size: 595 Bytes
db14d75
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
pipeline_tag: text-generation
inference: true
widget:
- text: 'Hello!'
  example_title: Hello world
  group: Python
library_name: transformers
---

# yujiepan/opt-350m-w8a8-unstructured50

This model is w8a8 quantized & unstructually sparsified by OpenVINO, exported from [facebook/opt-350m](https://huggingface.co/facebook/opt-350m).

**This model is not tuned for accuracy.**

- Quantization: 8-bit symmetric for weights & activations
- Unstructured sparsity in transformer block linear layers: 50%

Codes for export: https://gist.github.com/yujiepan-work/1e6dd9f9c2aac0e9ecaf2ed4d82d1158