jinaai
/

jina-embeddings-v2-small-en

Feature Extraction

sentence-transformers

sentence-similarity

text-embeddings-inference

Model card Files Files and versions Community

bwang0911 commited on Oct 23, 2023

Commit

e7623d9

·

1 Parent(s): 1549bcd

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -2664,6 +2664,12 @@ embeddings = model.encode(['How is the weather today?', 'What is the current wea
 print(cos_sim(embeddings[0], embeddings[1]))
 ```
 For long sequences, it's recommended to perform inference using Flash Attention. Using Flash Attention allows you to increase the batch size and throughput for long sequence length.
 We include an experimental implementation for Flash Attention, shipped with the model.
 Install the following triton version:

 print(cos_sim(embeddings[0], embeddings[1]))
 ```
+If you only want to handle shorter sequence, such as 2k, pass the `max_length` parameter to the `encode` function:
+```python
+embeddings = model.encode(['Very long ... document'], max_length=2048)
+```
 For long sequences, it's recommended to perform inference using Flash Attention. Using Flash Attention allows you to increase the batch size and throughput for long sequence length.
 We include an experimental implementation for Flash Attention, shipped with the model.
 Install the following triton version: