t5-ance / README.md
yushi's picture
Update README.md
265a344
|
raw
history blame
No virus
618 Bytes
metadata
license: mit

T5-ANCE

T5-ANCE generally follows the training procedure described in this page, but uses a much larger batch size.

Dataset used for training:

  • MS MARCO Passage

Evaluation result:

Dataset Metric Result
MS MARCO Passage (dev) MRR@10 0.3570

Important hyper-parameters:

Name Value
Global batch size 256
Learning rate 5e-6
Maximum length of query 32
Maximum length of document 128
Template for query <text>
Template for document Title: <title> Text: <text>

Paper

-