Transcribe speech to text
Generate images with SD3.5
Voice conversion framework based on VITS
Convert audio to different voices