mitchelldehaven
/

whisper-medium-ru

Automatic Speech Recognition Transformers PyTorch whisper whisper-event Eval Results Inference Endpoints

Model card Files Files and versions Community

whisper-medium-ru / README.md

mitchelldehaven's picture

mitchelldehaven

Update README.md

df50cf8 over 1 year ago

|

raw history blame contribute delete

No virus

707 Bytes

	---
	model-index:
	- name: whisper-medium-ru
	results:
	- task:
	type: automatic-speech-recognition
	name: Automatic Speech Recognition
	dataset:
	name: mozilla-foundation/common_voice_11_0
	type: mozilla-foundation/common_voice_11_0
	config: ru
	split: test
	metrics:
	- type: wer
	value: 9.65
	name: WER
	tags:
	- whisper-event
	---

	Whisper model finetuned using audio data from Open STT Russian Dataset (https://github.com/snakers4/open_stt).

	There is a differences in tokenization of source data (in our data normalization process, we replace punctucation with "" rather than Whisper's " "). This mismatch leads to a slight degradation on CommonVoice.