royshilkrot
/

whisper-large-v3-turbo-korean-ggml

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

whisper-large-v3-turbo-korean-ggml / README.md

royshilkrot's picture

Update README.md

8e89cc4 verified 3 months ago

|

history blame contribute delete

669 Bytes

	---
	license: apache-2.0
	datasets:
	- Junhoee/STT_Korean_Dataset_80000
	- Bingsu/zeroth-korean
	language:
	- ko
	metrics:
	- wer
	base_model:
	- openai/whisper-large-v3-turbo
	pipeline_tag: automatic-speech-recognition
	library_name: transformers
	---
	This model is a fine-tune of OpenAI's Whisper Large v3 Turbo model (https://huggingface.co/openai/whisper-large-v3-turbo) over the following Korean datasets:

	https://huggingface.co/datasets/Junhoee/STT_Korean_Dataset_80000
	https://huggingface.co/datasets/Bingsu/zeroth-korean
	Combined they have roughly 102k sentences.

	This is the last checkpoint which has achieved ~16 WER (down from ~24 WER).

	Training was 10,000 iterations.