keithito/lj_speech
Updated • 1.04k • 62
How to use Marcent/SpeechT5_finetune_TTS with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-audio", model="Marcent/SpeechT5_finetune_TTS") # Load model directly
from transformers import AutoProcessor, AutoModelForTextToSpectrogram
processor = AutoProcessor.from_pretrained("Marcent/SpeechT5_finetune_TTS")
model = AutoModelForTextToSpectrogram.from_pretrained("Marcent/SpeechT5_finetune_TTS")This model is a fine-tuned version of microsoft/speecht5_tts on the lj_speech dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.4685 | 0.3053 | 100 | 0.4252 |
| 0.4461 | 0.6107 | 200 | 0.4007 |
| 0.4316 | 0.9160 | 300 | 0.3924 |
| 0.4197 | 1.2198 | 400 | 0.3856 |
| 0.4175 | 1.5252 | 500 | 0.3825 |
Base model
microsoft/speecht5_tts