Please post f16 quantization.

#1
by ZeroWw - opened

Please post f16 quantization.
Requantizing is better from f16 or f32.
If you can, post them both.

I through the original format is BF16.

yes.. but f16 (fp16) does not cause harm to the model. bf16 is way bigger.

BF16 and F16 should be identical in size

If you need the f32 i uploaded it here: https://huggingface.co/bartowski/Qwen2-7B-Instruct-GGUF/blob/main/Qwen2-7B-Instruct-f32.gguf

Sign up or log in to comment