Inference Endpoints (dedicated)

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

API Reference (Swagger)

Inference Endpoints can be used through the UI and programmatically through an API. Here you’ll find the open-API specification for each available route, which you can call directly, or through the Hugging Face Hub python client.

Update on GitHub

←About Inference Endpoints Deploy your own chat application→