hkunlp
/

instructor-large

Model card Files Files and versions Community

Hosted API

by tiagofreitas87 - opened Apr 8, 2023

Discussion

tiagofreitas87

Apr 8, 2023

Is anyone running an API for embedding? Otherwise what is the best host for a serverless api to do embeddings?
Thanks

multi-train

NLP Group of The University of Hong Kong org Apr 9, 2023

Hi, Thanks for your interest in the INSTRUCTOR model!

One good way to run the INSTRUCTOR model without using local GPUs would be, computing embeddings in Google Colab. Here is an example script: https://colab.research.google.com/drive/1P7ivNLMosHyG7XOHmoh7CoqpXryKy3Qt?usp=sharing

Hope this helps! Feel free to add further questions or comments!

tiagofreitas87

Apr 9, 2023

I need a serverless API (pay per second api) as it’s not worth it to pay for a full GPU for now.

multi-train

NLP Group of The University of Hong Kong org Apr 9, 2023

Hi, Thanks a lot for your question!

The Colab service is free, and you may try the INSTRUCTOR model there without paying for GPU. Currently, we may not have an API for calculating embeddings.

Banso

May 9, 2023

This comment has been hidden

Banso

May 9, 2023

This comment has been hidden

Banso

May 9, 2023

@tiagofreitas87 maybe this is relevant for you https://embaas.io

tiagofreitas87

May 9, 2023

Thanks, but the discord invite is not working on that page, and MosaicML just launched an inference service that has Instruct embeddings.
https://www.mosaicml.com/inference

They are more established so it would be difficult to compete with them, unless Embaas or another service provide cheaper/easier continuous fine-tuning to a specific domain, but there are privacy concerns for enterprises.

Banso

May 9, 2023

Disclaimer: I am also working on embaas.

Thank you for the hint. We fixed the invitation. And it's cool that MosaicML has just added Insturctor. We are currently working on fine tuning the model on other languages. :)

maxsagt

Jun 21, 2023

@tiagofreitas87 If you're up for the challenge and latency is not an issue for you, you could try https://github.com/maxsagt/lambda-instructor which lets you deploy Instructor-Large on an AWS Lambda. It runs at ca. 6 seconds per request and costs less than $0.001 / request (depending on setup).

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment