IQ2_XXS vs IQ2_M

#8
by markgb1 - opened

It seems like IQ2_XXS and IQ2_M are the same size, is there functionally any difference between them (e.g. in memory usage)? I usually would look at the unsloth website for perplexity vs memory usage benchmarks, but it doesn't seem like this model has been benchmarked yet.

Sign up or log in to comment