How exactly do we get scalar quantization and product quantization?

#23

by masdeval - opened Nov 15

Nov 15

How do we get an embedding with float16 values?

Is the truncation the way to get the benefit of MRL? So, this means that mxbai-embed-large-v1 was trained to return 1024 dimension embedding but, because it uses MRL, we can safely get only the first 512 values?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment