Muennighoff/SGPT-1.3B-weightedmean-msmarco-specb-bitfit · Sequence Length Setting for Sentence Transformer

Hey,

First off, thanks for your work here.

I was testing out these SGPT models using the SentenceTransformer package and I noticed the sentence_bert_config.json has a max_seq_length=300 parameter. This causes the tokenizer to truncate at 300 tokens, but I know the model itself is intended to have a 2k length. I looked in Github and saw there that its suggested to load the AutoModel and AutoTokenizer then unpack the hidden layers and call the model through torch directly. Testing this gave me the correct 2k sequence length as best I can tell, but it might be worthwhile to modify that sentence_bert_config just for ease of use.