SGPT-2.7B-weightedmean-msmarco-specb-bitfit
Usage
For usage instructions, refer to our codebase: https://github.com/Muennighoff/sgpt
Evaluation Results
For eval results, refer to the eval folder or our paper: https://arxiv.org/abs/2202.08904
Training
The model was trained with the parameters:
DataLoader:
torch.utils.data.dataloader.DataLoader
of length 124796 with parameters:
{'batch_size': 4, 'sampler': 'torch.utils.data.sampler.RandomSampler', 'batch_sampler': 'torch.utils.data.sampler.BatchSampler'}
Loss:
sentence_transformers.losses.MultipleNegativesRankingLoss.MultipleNegativesRankingLoss
with parameters:
{'scale': 20.0, 'similarity_fct': 'cos_sim'}
Parameters of the fit()-Method:
{
"epochs": 10,
"evaluation_steps": 0,
"evaluator": "NoneType",
"max_grad_norm": 1,
"optimizer_class": "<class 'transformers.optimization.AdamW'>",
"optimizer_params": {
"lr": 7.5e-05
},
"scheduler": "WarmupLinear",
"steps_per_epoch": null,
"warmup_steps": 1000,
"weight_decay": 0.01
}
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 300, 'do_lower_case': False}) with Transformer model: GPTNeoModel
(1): Pooling({'word_embedding_dimension': 2560, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': True, 'pooling_mode_lasttoken': False})
)
Citing & Authors
@article{muennighoff2022sgpt,
title={SGPT: GPT Sentence Embeddings for Semantic Search},
author={Muennighoff, Niklas},
journal={arXiv preprint arXiv:2202.08904},
year={2022}
}
- Downloads last month
- 269
Evaluation results
- accuracy on MTEB AmazonCounterfactualClassification (en)test set self-reported67.567
- ap on MTEB AmazonCounterfactualClassification (en)test set self-reported30.756
- f1 on MTEB AmazonCounterfactualClassification (en)test set self-reported61.805
- accuracy on MTEB AmazonPolarityClassificationtest set self-reported71.440
- ap on MTEB AmazonPolarityClassificationtest set self-reported65.913
- f1 on MTEB AmazonPolarityClassificationtest set self-reported70.906
- accuracy on MTEB AmazonReviewsClassification (en)test set self-reported35.748
- f1 on MTEB AmazonReviewsClassification (en)test set self-reported35.486
- map_at_1 on MTEB ArguAnatest set self-reported25.960
- map_at_10 on MTEB ArguAnatest set self-reported41.619
- map_at_100 on MTEB ArguAnatest set self-reported42.673
- map_at_1000 on MTEB ArguAnatest set self-reported42.684
- map_at_3 on MTEB ArguAnatest set self-reported36.569
- map_at_5 on MTEB ArguAnatest set self-reported39.397
- mrr_at_1 on MTEB ArguAnatest set self-reported26.316
- mrr_at_10 on MTEB ArguAnatest set self-reported41.772
- mrr_at_100 on MTEB ArguAnatest set self-reported42.820
- mrr_at_1000 on MTEB ArguAnatest set self-reported42.830
- mrr_at_3 on MTEB ArguAnatest set self-reported36.724
- mrr_at_5 on MTEB ArguAnatest set self-reported39.529
- ndcg_at_1 on MTEB ArguAnatest set self-reported25.960
- ndcg_at_10 on MTEB ArguAnatest set self-reported50.491
- ndcg_at_100 on MTEB ArguAnatest set self-reported54.865
- ndcg_at_1000 on MTEB ArguAnatest set self-reported55.107
- ndcg_at_3 on MTEB ArguAnatest set self-reported40.053
- ndcg_at_5 on MTEB ArguAnatest set self-reported45.134
- precision_at_1 on MTEB ArguAnatest set self-reported25.960
- precision_at_10 on MTEB ArguAnatest set self-reported7.895
- precision_at_100 on MTEB ArguAnatest set self-reported0.978
- precision_at_1000 on MTEB ArguAnatest set self-reported0.100
- precision_at_3 on MTEB ArguAnatest set self-reported16.714
- precision_at_5 on MTEB ArguAnatest set self-reported12.489
- recall_at_1 on MTEB ArguAnatest set self-reported25.960
- recall_at_10 on MTEB ArguAnatest set self-reported78.947
- recall_at_100 on MTEB ArguAnatest set self-reported97.795
- recall_at_1000 on MTEB ArguAnatest set self-reported99.644
- recall_at_3 on MTEB ArguAnatest set self-reported50.142
- recall_at_5 on MTEB ArguAnatest set self-reported62.447
- v_measure on MTEB ArxivClusteringP2Ptest set self-reported44.721
- v_measure on MTEB ArxivClusteringS2Stest set self-reported35.081
- map on MTEB AskUbuntuDupQuestionstest set self-reported59.635
- mrr on MTEB AskUbuntuDupQuestionstest set self-reported73.681
- cos_sim_pearson on MTEB BIOSSEStest set self-reported87.428
- cos_sim_spearman on MTEB BIOSSEStest set self-reported84.843
- euclidean_pearson on MTEB BIOSSEStest set self-reported85.593
- euclidean_spearman on MTEB BIOSSEStest set self-reported85.853
- manhattan_pearson on MTEB BIOSSEStest set self-reported85.412
- manhattan_spearman on MTEB BIOSSEStest set self-reported85.523
- accuracy on MTEB Banking77Classificationtest set self-reported83.218
- f1 on MTEB Banking77Classificationtest set self-reported83.154
- v_measure on MTEB BiorxivClusteringP2Ptest set self-reported34.414