Does it support server mode?

by yinkaisheng - opened 1 day ago

Hi, I noticed that the current README only shows a CLI usage example (locate-anything-cli detect ...).
I was wondering if there is any support for running this model through a llama-server / OpenAI-compatible HTTP server (similar to llama.cpp server mode)?
Or is the CLI the only supported inference interface at the moment?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment