1 2 2

Yiwei Guo

cantabile-kwok

cantabile-kwok

AI & ML interests

Text to Speech

Recent Activity

upvoted a paper about 1 month ago

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

updated a Space 5 months ago

cantabile-kwok/vec2wav2.0-demo

updated a dataset 6 months ago

cantabile-kwok/libritts-all-kaldi-data

View all activity

Organizations

cantabile-kwok's activity

upvoted a paper about 1 month ago

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Paper • 2503.08625 • Published Mar 11 • 26

updated a Space 5 months ago

Vec2wav2.0 Demo

🏃

vec2wav 2.0, a speech token vocoder for VC. Arxiv 2409.01995

updated a dataset 6 months ago

cantabile-kwok/libritts-all-kaldi-data

Updated Nov 6, 2024 • 24

New activity in novateur/WavTokenizer-large-unify-40token 6 months ago

will the config be uploaded for it?

#1 opened 6 months ago by

breadlicker45

updated a model 6 months ago

cantabile-kwok/vec2wav2.0

Updated Oct 26, 2024 • 2

upvoted a paper 6 months ago

MobA: A Two-Level Agent System for Efficient Mobile Task Automation

Paper • 2410.13757 • Published Oct 17, 2024 • 33

liked a model 9 months ago

facebook/wav2vec2-xlsr-53-espeak-cv-ft

Automatic Speech Recognition • Updated Dec 10, 2021 • 347k • 31

updated a model over 1 year ago

X-LANCE/ctx_vec2wav_libritts_all

Updated Nov 21, 2023

liked a model over 1 year ago

facebook/mms-tts-hne

Text-to-Speech • Updated Sep 1, 2023 • 4 • 1

updated 2 models over 1 year ago

cantabile-kwok/hifigan-libritts-800-200

Updated Oct 8, 2023

cantabile-kwok/hifigan-ljspeech-1024-256

Updated Oct 8, 2023

updated a dataset over 1 year ago

cantabile-kwok/ljspeech-1024-256-dur

Updated Oct 8, 2023 • 26