m2v-e5-large-european

A Model2Vec static embedding model distilled from intfloat/multilingual-e5-large (560M params), pruned to European languages only.

Pruned 36.5% of tokens (removed CJK, Arabic, Hebrew, Thai, Devanagari, Korean, Japanese, etc.).

Before pruning After pruning
Vocabulary 249,999 tokens 158,843 tokens
Embedding dim 256 256

Usage

from model2vec import StaticModel
model = StaticModel.from_pretrained("flipbitsnotburgers/m2v-e5-large-european")
embeddings = model.encode(["deodorant", "Duschgel", "shower gel"])

License

MIT (same as base model)

Downloads last month
8
Safetensors
Model size
40.7M params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for flipbitsnotburgers/m2v-e5-large-european

Finetuned
(171)
this model