Inference Endpoints Performance

by philschmid - opened Sep 14, 2023

Sep 14, 2023

Hey Philipp here,

Congrats on the new model release looks awesome! But I am curious to know how you created the benchmark numbers for inference Endpoints. Since it seems that the model needs "remote_trust_code" and there is no handler.py to deploy it.

itay-levy

Sep 14, 2023

Hey Philipp!
Apologies for the oversight. The baseline numbers in the README file refer to benchmarking results without a managed solution - just HF + PyTorch, not HF Inference Endpoints. We've updated the README to clarify. Thanks for pointing it out
Also, a heartfelt thank you for all your open-source contributions!

deleted

Sep 17, 2023

This comment has been hidden

itay-levy changed discussion status to closed Sep 21, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment