input embeddings, output similarity. Can this be done through an inference endpoint?

#38
by click37 - opened

Hi there

I am computing the embeddings of sentences using the model. I store them so that I don't run this computation in real time. I then get a new sentence in real-time, compute the embedding and want to run a sentence similarity between this new embedding and all others I did prior.

Can this be done here?

You can use any open source vector db(chroma, weaviate) to retrieve best matches of semantic similar sentences. I hope that's what you're looking for.

Sign up or log in to comment