Edit model card

T5-ANCE

T5-ANCE generally follows the training procedure described in this page, but uses a much larger batch size.

Dataset used for training:

  • MS MARCO Passage

Evaluation result:

Dataset Metric Result
MS MARCO Passage (dev) MRR@10 0.3570

Important hyper-parameters:

Name Value
Global batch size 256
Learning rate 5e-6
Maximum length of query 32
Maximum length of document 128
Template for query <text>
Template for document Title: <title> Text: <text>

Paper

-

Downloads last month
673
Safetensors
Model size
223M params
Tensor type
F32
·