mean pooling code in readme

#15

by claralp - opened Aug 5, 2024

Aug 5, 2024

Isn't there a mistake in the example code for mean pooling strategy:
outputs = torch.sum(outputs * inputs["attention_mask"][:, :, None], dim=1) / torch.sum(inputs["attention_mask"])
but it should be:
outputs = torch.sum(outputs * inputs["attention_mask"][:, :, None], dim=1) / torch.sum(inputs["attention_mask"], dim=1, keepdim=True)?

Otherwise it takes the sum of all embedded texts and not just the current one

aamirshakir

Mixedbread org Aug 5, 2024

Thank you @claralp ! Yes you are right, oversaw this, sorry for that. Will fix!

aamirshakir changed discussion status to closed Aug 5, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment