I'm a little late... I guess.

Link to original model and script:

Downloads last month
12
GGUF
Model size
12.2B params
Architecture
llama

3-bit

4-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.