README.md · Snowad/French-Tortoise at 9837e38c84a724a3ec5d1241737b8d2c1ea60334

metadata

license: apache-2.0
language:
  - fr
pipeline_tag: text-to-speech
tags:
  - TTS
  - text-to-speech

V2.5 Model : Comming Soon™ Fine tune of my V2 model on all CommonVoice dataset, it made the voice clone better from the V2 model, I will train it more in the next days

V2 Model :

Tortoise base model Fine tuned on a custom multispeaker French dataset of 120k samples (SIWIS + Common Voice subset + M-AILABS) on 10k step with a RTX 3090 (~= 21 hours of training), with Text LR Weight at 1 Result : The model can speak French much better without an English accent but the voice clone hardly works

V1 Model :

Tortoise base model Fine tuned on a custom multispeaker French dataset of 24k samples (SIWIS + Common Voice subset) on 8850 step with a RTX 3090 (~= 19 hours of training)

Inference :

You can use the model by downloading the "V2_9750_gpt.pth" model and use it in the tortoise-tts optimized forks (git.ecker.tech/mrq/ai-voice-cloning | 152334H/tortoise-tts-fast)

Fine tuning :

I used 152334H/DL-Art-School for training, if you want to resume training from my epoch, follow its documentation and download "V2_9750.state"