Why does this model only have 1.27B params?

by boy977 - opened Mar 4

Mar 4

As displaying by huggingface, this mlx-4 bit version only has 1.27B params mixed of FP16 and U32. Can the team provide more technical information about how this model was made?

qnguyen3

Vietnamese Mistral org Mar 4

@boy977 Hi, this model is the same as the original Vistral without any modification, the only change is that this version is made for Apple MLX Backend (you need an Apple Silicon Mac M1/M2/M3 to run it) + 4bit quantization. I think what being shown is a bug from HuggingFace that they have not been able to fix. Anyway, the model is 7B param in total.

qnguyen3 changed discussion status to closed Mar 4

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment