MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published 23 days ago • 32
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 68 items • Updated 17 days ago • 111