whisper-small-setswana (ONNX for Transformers.js)

ONNX export (q8 / int8 quantized) of misterkissi/whisper-small-setswana, packaged for in-browser speech recognition with ๐Ÿค— Transformers.js.

This lets a Setswana (Tswana) Whisper model run entirely client-side in the browser (WebGPU with WASM fallback) โ€” no server, no audio leaving the device.

Files

  • onnx/encoder_model_quantized.onnx (~88 MB)
  • onnx/decoder_model_merged_quantized.onnx (~186 MB)
  • tokenizer / preprocessor / config JSON

Usage

import { pipeline } from '@huggingface/transformers';

const asr = await pipeline(
  'automatic-speech-recognition',
  'Hydramus/whisper-small-setswana-onnx',
  { device: 'webgpu', dtype: 'q8' }
);
const { text } = await asr(float32AudioAt16kHz);

Attribution & license

Quality note

The base fine-tune reports WER โ‰ˆ 0.48 with signs of overfitting, so transcripts are a useful rough draft for Setswana but not production-clean. Quantization to int8 preserves the original behaviour (verified: outputs match the PyTorch model up to minor rounding).

Downloads last month
31
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Hydramus/whisper-small-setswana-onnx

Quantized
(1)
this model