spow12 commited on
Commit
07ba25b
1 Parent(s): 5c25512

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - Murple/ksponspeech
5
+ language:
6
+ - ko
7
+ metrics:
8
+ - cer
9
+ - wer
10
+ pipeline_tag: automatic-speech-recognition
11
+ ---
12
+ # Whisper-Medium-KsponSpeech
13
+
14
+ The Whisper-medium Model finetunned with [KsponSpeech](https://huggingface.co/datasets/Murple/ksponspeech)
15
+
16
+
17
+ ### Model Description
18
+
19
+ <!-- Provide a longer summary of what this model is. -->
20
+
21
+
22
+
23
+ - **Developed by :** [yw0nam](https://github.com/yw0nam)
24
+ - **Shared by :** [yw0nam](https://github.com/yw0nam)
25
+ - **Model type :** ASR
26
+ - **License:** [apache-2.0]
27
+
28
+ ## Uses
29
+
30
+ ```
31
+
32
+ processor = WhisperProcessor.from_pretrained("openai/whisper-medium", language="ko", task="transcribe")
33
+ model = WhisperForConditionalGeneration.from_pretrained('spow12/whisper-medium-zeroth_korean').cuda()
34
+
35
+ data, _ = librosa.load(wav_path, sr=16000)
36
+ input_features = processor(data, sampling_rate=16000, return_tensors="pt").input_features.cuda()
37
+
38
+ predicted_ids = model.generate(input_features)
39
+ transcription = processor.batch_decode(predicted_ids, skip_special_tokens=True)[0]
40
+
41
+ ```
42
+
43
+ ### Metrics
44
+
45
+ Metric | result |
46
+ --- | --- |
47
+ WER | 3.96 |
48
+ CER | 1.71 |
49
+