Access 🤗 Inference Endpoints

To access the Inference Endpoints web application, you or your organization need to add a valid payment method to your Hugging Face account.

You can check your [billing](https://huggingface.co/settings/billing) if you're unsure whether you have an active payment method.

After you’ve added a valid payment method to your account, access the Inference Endpoints web application and start deploying! 🥳

There are a few pricing plans:

Inference Endpoints pricing is pay-as-you-go based on your hourly compute, number of replicas, and billed monthly. This can be as low as $0.032 per CPU core/hr and $0.5 per GPU/hr depending on your needs.
Inference Endpoints Enterprise plan which offers dedicated support, 24/7 SLAs, and uptime guarantees. Pricing for Enterprise is custom and based on volume commit and annual contracts; contact us for a quote!

Don’t forget to subscribe to PRO and/or your organization to Enterprise Hub for a variety of premium features and priority support 🚀 Sign up for PRO here.

We also have Enterprise Hub contract-based invoicing available, which allows for more payment options + prepaid compute credits. You can request a quote in your org’s billing settings. Sign up your organization for Enterprise Hub here.

< > Update on GitHub

Inference Endpoints (dedicated)

Access 🤗 Inference Endpoints