BatiSay-ko-base
νκ΅μ΄ fine-tuned Whisper Large v3 Turbo β Apache 2.0.
CER (KsponSpeech eval_clean 3000 sample)
| Model | CER |
|---|---|
| OpenAI Whisper Large v3 (raw) | 17.03% |
| batisay-ko-base | 8.68% |
| Return Zero (ν보) | 5.91-6.18% |
νμΌ
ggml-batisay-ko-base.bin 1.6 GB (F32, μ΅κ³ quality)
ggml-batisay-ko-base-q5_0.bin 547 MB (Q5, balanced) recommended
ggml-batisay-ko-base-q4_0.bin 452 MB (Q4, Mac 8GB)
model.safetensors 1.6 GB (transformers)
μ¬μ© β whisper.cpp / BatiFlow App
WhisperModel(
name: "BatiSay-ko-base",
url: "https://huggingface.co/batiai/batisay-ko-base/resolve/main/ggml-batisay-ko-base-q5_0.bin"
)
μ¬μ© β Python
from transformers import WhisperForConditionalGeneration, WhisperProcessor
model = WhisperForConditionalGeneration.from_pretrained('batiai/batisay-ko-base')
processor = WhisperProcessor.from_pretrained('batiai/batisay-ko-base', language='Korean', task='transcribe')
νμ΅
- Base: openai/whisper-large-v3-turbo (809M params)
- Data: KsponSpeech 1000h + Zeroth-Korean 50h
- Epoch: 3, LR 1e-5 linear, 2 GPU DDP (A6000 48GB)
- Train time: 35.8h
- νμ΅ μμ : 2026-05-28
λΌμ΄μΌμ€
Apache 2.0 (BatiAI Open Tier 1)
ν₯ν
- batisay-ko-large (V10): 5-6% CER, 2026-06-11 μΆμ μμ
- batisay-ko-base-v11: 7-7.5% CER, V11 λ°μ΄ν° κ°ν, 2026-06-13 μΆμ μμ
- Downloads last month
- 180
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support
Model tree for batiai/batisay-ko-base
Base model
openai/whisper-large-v3 Finetuned
openai/whisper-large-v3-turbo