MagnusSa
/

nb-bert-base-matryoshka

Sentence Similarity

sentence-transformers

feature-extraction

Generated from Trainer

dataset_size:132907

loss:MatryoshkaLoss

loss:MultipleNegativesRankingLoss

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

MagnusSa commited on 20 days ago

Commit

23965e6

•

1 Parent(s): 50fb745

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -447,6 +447,7 @@ model-index:
 # nb-bert-base-matryoshka
 This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [NbAiLab/nb-bert-base](https://huggingface.co/NbAiLab/nb-bert-base) on the utdanning_pair, [ltg/norquad](https://huggingface.co/datasets/ltg/norquad) and [NbAiLab/mnli-norwegian](https://huggingface.co/datasets/NbAiLab/mnli-norwegian) datasets. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
 ## Model Details

 # nb-bert-base-matryoshka
 This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [NbAiLab/nb-bert-base](https://huggingface.co/NbAiLab/nb-bert-base) on the utdanning_pair, [ltg/norquad](https://huggingface.co/datasets/ltg/norquad) and [NbAiLab/mnli-norwegian](https://huggingface.co/datasets/NbAiLab/mnli-norwegian) datasets. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
+As with the BGE architecture and Artic-embed I use the final hidden state of the [CLS] token as the embedding vector, instead of a mean pooling strategy.
 ## Model Details