speakleash
/

Bielik-11B-v2.2-Instruct-MLX-8bit

Text Generation

text-generation-inference

Model card Files Files and versions Community

Bielik-11B-v2.2-Instruct-MLX-8bit

This model was converted to MLX format from SpeakLeash's Bielik-11B-v.2.2-Instruct.

DISCLAIMER: Be aware that quantised models show reduced response quality and possible hallucinations!

Use with mlx

pip install mlx-lm

from mlx_lm import load, generate

model, tokenizer = load("speakleash/Bielik-11B-v2.2-Instruct-MLX-8bit")
response = generate(model, tokenizer, prompt="hello", verbose=True)

Model description:

Developed by: SpeakLeash & ACK Cyfronet AGH
Language: Polish
Model type: causal decoder-only
Quant from: Bielik-11B-v2.2-Instruct
Finetuned from: Bielik-11B-v2
License: Apache 2.0 and Terms of Use

Responsible for model quantization

Remigiusz Kinas^SpeakLeash - team leadership, conceptualizing, calibration data preparation, process creation and quantized model delivery.

Contact Us

If you have any questions or suggestions, please use the discussion tab. If you want to contact us directly, join our Discord SpeakLeash.

Downloads last month: 18

Safetensors

Model size

3.24B params

Tensor type

FP16

·

U32

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Model tree for speakleash/Bielik-11B-v2.2-Instruct-MLX-8bit

Base model

speakleash/Bielik-11B-v2

Finetuned

speakleash/Bielik-11B-v2.2-Instruct

Finetuned

(9)

this model

Collection including speakleash/Bielik-11B-v2.2-Instruct-MLX-8bit

Bielik-11B-v2.2

A collection of models based on Bielik-11B-v2.2 - instruct and quantized versions. • 17 items • Updated Oct 26, 2024 • 28