F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Fast Inversion of Rectified Flow for Image Semantic Editing.
FLUX 4-bit Quantization(just 8GB VRAM)