This is the test version for pruning. This model is a base model that will be pruned and quantized for on-device purpose.

I used mergekit for merging two models:

The two models I combined are:

Downloads last month
1,751
Safetensors
Model size
10.7B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for alnrg2arg/test

Quantizations
1 model