Interact with AI using text, images, or audio
OmniParser, turn your LLM into GUI agent
An end-to-end (e2e) Voice Language Model by Fish Audio.
Added improvements, 1107+ languages supported
Generate structured output using prompts