BatiSay-ko-base

ν•œκ΅­μ–΄ fine-tuned Whisper Large v3 Turbo β€” Apache 2.0.

CER (KsponSpeech eval_clean 3000 sample)

Model CER
OpenAI Whisper Large v3 (raw) 17.03%
batisay-ko-base 8.68%
Return Zero (홍보) 5.91-6.18%

파일

ggml-batisay-ko-base.bin         1.6 GB  (F32, 졜고 quality)
ggml-batisay-ko-base-q5_0.bin    547 MB  (Q5, balanced) recommended
ggml-batisay-ko-base-q4_0.bin    452 MB  (Q4, Mac 8GB)
model.safetensors                1.6 GB  (transformers)

μ‚¬μš© β€” whisper.cpp / BatiFlow App

WhisperModel(
  name: "BatiSay-ko-base",
  url: "https://huggingface.co/batiai/batisay-ko-base/resolve/main/ggml-batisay-ko-base-q5_0.bin"
)

μ‚¬μš© β€” Python

from transformers import WhisperForConditionalGeneration, WhisperProcessor
model = WhisperForConditionalGeneration.from_pretrained('batiai/batisay-ko-base')
processor = WhisperProcessor.from_pretrained('batiai/batisay-ko-base', language='Korean', task='transcribe')

ν•™μŠ΅

  • Base: openai/whisper-large-v3-turbo (809M params)
  • Data: KsponSpeech 1000h + Zeroth-Korean 50h
  • Epoch: 3, LR 1e-5 linear, 2 GPU DDP (A6000 48GB)
  • Train time: 35.8h
  • ν•™μŠ΅ μ‹œμ : 2026-05-28

λΌμ΄μ„ΌμŠ€

Apache 2.0 (BatiAI Open Tier 1)

ν–₯ν›„

  • batisay-ko-large (V10): 5-6% CER, 2026-06-11 μΆœμ‹œ μ˜ˆμ •
  • batisay-ko-base-v11: 7-7.5% CER, V11 데이터 κ°•ν™”, 2026-06-13 μΆœμ‹œ μ˜ˆμ •
Downloads last month
180
Safetensors
Model size
0.8B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for batiai/batisay-ko-base

Finetuned
(539)
this model