Update README.md
Browse files
README.md
CHANGED
@@ -8,6 +8,8 @@ base_model:
|
|
8 |
- ielabgroup/bert-base-uncased-fineweb100bt-smae
|
9 |
---
|
10 |
|
|
|
|
|
11 |
This model is a bert-baase-uncased size model initialized with [ielabgroup/bert-base-uncased-fineweb100bt-smae](https://huggingface.co/ielabgroup/bert-base-uncased-fineweb100bt-smae) fine-tuned on MS MARCO dataset with Starbucks Representation Learning (SRL) method. SRL enables elastic layer-dimension embedding generation for search.
|
12 |
|
13 |
The following layer-dimension pairs are involved during fine-tuning: [(2, 32), (4, 64), (6, 128), (8, 256), (10, 512), (12, 768)]
|
|
|
8 |
- ielabgroup/bert-base-uncased-fineweb100bt-smae
|
9 |
---
|
10 |
|
11 |
+
Model used in [Starbucks: Improved Training for 2D Matryoshka Embeddings](https://arxiv.org/pdf/2410.13230)
|
12 |
+
|
13 |
This model is a bert-baase-uncased size model initialized with [ielabgroup/bert-base-uncased-fineweb100bt-smae](https://huggingface.co/ielabgroup/bert-base-uncased-fineweb100bt-smae) fine-tuned on MS MARCO dataset with Starbucks Representation Learning (SRL) method. SRL enables elastic layer-dimension embedding generation for search.
|
14 |
|
15 |
The following layer-dimension pairs are involved during fine-tuning: [(2, 32), (4, 64), (6, 128), (8, 256), (10, 512), (12, 768)]
|