This is the test version for pruning. This model is a base model that will be pruned and quantized for on-device purpose.

I used mergekit for merging two models:

The two models I combined are:

Safetensors

Model size

10.7B params

Tensor type

FP16

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

Model tree for alnrg2arg/test

Quantizations