CLI

WIP

Dataset

Dataset.bat webui (python webui_dataset.py) consists of slice audio and transcribe wavs.

python slice.py -i <input_dir> -o <output_dir> -m <min_sec> -M <max_sec>

Required:

Optional:

python transcribe.py -i <input_dir> -o <output_file> --speaker_name <speaker_name>

Required:

Optional

--initial_prompt: Initial prompt to use for the transcription (default value is specific to Japanese).
--device: cuda or cpu (default: cuda).
--language: jp, en, or en (default: jp).
--model: Whisper model, default: large-v3
--compute_type: default: bfloat16

Train.bat webui (python webui_train.py) consists of the following.

python resample.py -i <input_dir> -o <output_dir> [--normalize] [--trim]

Required:

input_dir: Path to the directory containing the audio files to preprocess.
output_dir: Path to the directory where the preprocessed audio files will be saved.

TO BE WRITTEN (WIP)

これいる？