Yi-6B-200K - AWQ

This is a quantized (AWQ) version of Yi-6B-200K.

For more information about the model, see the original page.

It was quantized in the same way as the Yi-34B-200K: more information about it - here.

Safetensors

Model size

1.27B params

Tensor type

I32

FP16

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Model tree for Sombressoul/Yi-6B-200K-AWQ

Base model

01-ai/Yi-6B-200K

Quantized

(10)

this model