File size: 1,001 Bytes
8f0745b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 |
---
license: apache-2.0
language:
- hi
pipeline_tag: automatic-speech-recognition
---
int8 quantized [ctranslate2](https://github.com/OpenNMT/CTranslate2)-compatible version of [vasista22/whisper-hindi-large-v2](https://huggingface.co/vasista22/whisper-hindi-large-v2).
This means the 5.7GB model is compressed into 1.6GB :).
Model created using
```
ct2-transformers-converter --model /path/to/vasista22/whisper-hindi-large-v2 --output_dir whisper-hindi-large-v2-ct2-int8 --copy_files tokenizer_config.json preprocessor_config.json added_tokens.json special_tokens_map.json --quantization int8
```
For monospeaker audio, use either of
1. [ctranslate2](https://github.com/OpenNMT/CTranslate2)
2. [faster-whisper](https://github.com/SYSTRAN/faster-whisper)
For multispeaker audio with english diarization, use [whisperX](https://github.com/m-bain/whisperX/).
For multispeaker audio with non-english diarization, use [whisper-diarization](https://github.com/MahmoudAshraf97/whisper-diarization/).
|