Text-to-Speech
F5-TTS
Hindi
hindi
tts
fine-tuned

F5-TTS Hindi

Fine-tuned F5-TTS model for Hindi text-to-speech.

Training Details

  • Base model: SPRINGLab/F5-Hindi-24KHz (151M params, F5-TTS Small)
  • Datasets: somu9/iisc_mono_hindi_female (Studio-quality single-speaker Hindi female TTS dataset from the SYSPIN project by Indian Institute of Science (IISc), Bengaluru)
  • Learning rate: 1e-05
  • Epochs: 3
  • Batch size: 200 frames/GPU
  • Architecture: DiT (dim=768, depth=18, heads=12) + ConvNeXt V2 (dim=512, layers=4)
  • Audio: 24kHz, 100-dim mel spectrogram, Vocos vocoder

Usage

from f5_tts.api import F5TTS
model = F5TTS(
    model_type="F5-TTS",
    ckpt_file="path/to/model_last.pt",  
    vocab_file="path/to/vocab.txt",
)
wav, sr, _ = model.infer(
    ref_file="reference.wav",
    ref_text="संदर्भ पाठ हिंदी में",
    gen_text="आपका दिन शुभ हो",
)
Downloads last month
39
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for SwarajSolanke-turtle/F5TTS_female_trained

Finetuned
(3)
this model

Dataset used to train SwarajSolanke-turtle/F5TTS_female_trained