Generate images from text prompts
Extend images using prompts and alignment options
An end-to-end (e2e) Voice Language Model by Fish Audio.
Detect objects in images with Transformers.js