請問繁中的語音模型可以怎樣微調？

by doggy8088 - opened Oct 29, 2024

Oct 29, 2024

我想請問繁中的語音模型可以怎樣微調？不知道是否可以提供一些參考資源，謝謝！

Owner Oct 29, 2024

我目前是用自己寫的工具 wft 去微調，同時參考 HuggingFace 的文章。

這個模型訓練租用的雲端 GPU 成本大概 3 USD（我在 RunPod 上面租 A40 跑了 7 小時）。

btw Common Voice zh-TW subset 裡 train+validation的錄音好像其實也不多，也許還不到 20 小時；我還在看有沒有其他公開的資料集能用。

Oct 29, 2024

謝謝你的回覆！😊

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment