Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
HKUST Audio's picture
14 6 12

HKUST Audio PRO

HKUST-Audio
xfzhu's profile picture Zeyue7's profile picture lmxue's profile picture
·
  • wxue_audio

AI & ML interests

Audio Generation

Organizations

HKUST Audio's profile picture

Collections 1

Our AK Daily Papers
  • CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

    Paper • 2305.06908 • Published May 11, 2023 • 6
  • CoMoSVC: Consistency Model-based Singing Voice Conversion

    Paper • 2401.01792 • Published Jan 3, 2024 • 11
  • ChatMusician: Understanding and Generating Music Intrinsically with LLM

    Paper • 2402.16153 • Published Feb 25, 2024 • 61
  • FlashSpeech: Efficient Zero-Shot Speech Synthesis

    Paper • 2404.14700 • Published Apr 23, 2024 • 33

Papers 6

arxiv:2504.14906
arxiv:2503.08638
arxiv:2503.01710
arxiv:2502.05979

spaces 2

Running on Zero
12

Llasa 1B Multi Speakers Genshin Zh En Ja Ko

🚀

Llasa-1B-Multilingual finetuned using simon3000/genshin-voic

Feb 13
Running on Zero
8

Llasa 1B Finetuned For Two Speakers

🔥

Using dataset shb777/gemini-flash-2.0-speech for finetuning

Feb 13

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs