An end-to-end (e2e) Voice Language Model by Fish Audio.
Upgraded to v1.0!
Extract garment images from everyday images!
Identity-Preserving Text-to-Video Generation
Image generator/identifier/reposer