Whisper-tiny MyST (GC-LoRA)

Whisper-tiny adapted to the MyST dataset with GC-LoRA (Gated Convolutional LoRA), from the paper "GC-LoRA: Gated Convolutional LoRA for Parameter-Efficient Acoustic Adaptation" (Interspeech 2026).

  • Base model: openai/whisper-tiny.en
  • Method: GC-LoRA adapter on the encoder attention output projections (rank 8, kernel 31, scaling 16)
  • MyST test WER: 15.7%
  • Code: https://github.com/balaji1312/gc_lora

Usage

The checkpoint bundles the frozen Whisper backbone together with the trained GC-LoRA adapter. Loading requires the custom modeling code in the gc_lora repository; see src/bin/decode_asr.py there for loading and decoding.

Citation

@inproceedings{shankar2026gclora,
  author    = {Shankar, Natarajan Balaji and Wang, Zilai and Zhang, Kaiyuan and Shi, Mohan and Alwan, Abeer},
  title     = {{GC-LoRA}: Gated Convolutional {LoRA} for Parameter-Efficient Acoustic Adaptation},
  booktitle = {Interspeech 2026},
  year      = {2026},
}
Downloads last month
13
Safetensors
Model size
37.8M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support