How exactly do we get scalar quantization and product quantization?

#23
by masdeval - opened

How do we get an embedding with float16 values?

Is the truncation the way to get the benefit of MRL? So, this means that mxbai-embed-large-v1 was trained to return 1024 dimension embedding but, because it uses MRL, we can safely get only the first 512 values?

Sign up or log in to comment