Maximum Sequence Length

by tomaarsen - opened 1 day ago

The model card mentions both a Context Length of 32,768 tokens, as well as a "Document length" of 512 tokens. Sentence Transformers uses the "max_seq_length": 512, from https://huggingface.co/LiquidAI/LFM2.5-Embedding-350M/blob/main/sentence_bert_config.json#L2 and adopts the 512. Is this indeed the expected behaviour? Just making sure 🤗

Tom Aarsen

EdoardoMosca

Liquid AI org 1 day ago

Hey @tomaarsen thanks for flagging this!

512 is indeed the right number. The 32k context length refers to the original LFM2.5 backbone. But I do agree it can be confusing, I'll remove it across cards

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment