clvp_dev / README.md
susnato's picture
Create README.md
8819783
|
raw
history blame
661 Bytes

DISCLAIMER : I do not own any weights present in this repository. All weights belong to the author of the paper - "Better speech synthesis through scaling", James Betker . I am storing the weights(temporarily) for the tortoise-tts integration to Huggingface. Please refer to this PR to know more.

About

CLVP model is an integral part of tortoise-tts presented in the paper - "Better speech synthesis through scaling" by James Betker. CLVP uses an architecture similar to the CLIP text encoder, except it uses two of them: one for text tokens and the other for MEL tokens.