runtime error

B/s] model.bin: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 231M/484M [00:07<00:09, 26.1MB/s] model.bin: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 241M/484M [00:07<00:08, 27.0MB/s] model.bin: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 262M/484M [00:07<00:07, 31.1MB/s] model.bin: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 273M/484M [00:08<00:06, 31.8MB/s] model.bin: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 283M/484M [00:09<00:09, 22.2MB/s] model.bin: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 294M/484M [00:09<00:07, 25.6MB/s] model.bin: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 304M/484M [00:09<00:07, 25.5MB/s] model.bin: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 325M/484M [00:10<00:04, 31.9MB/s] model.bin: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 336M/484M [00:10<00:04, 33.5MB/s] model.bin: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 346M/484M [00:10<00:03, 38.1MB/s] model.bin: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 357M/484M [00:11<00:03, 34.5MB/s] model.bin: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 377M/484M [00:11<00:02, 37.0MB/s] model.bin: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 388M/484M [00:12<00:03, 28.8MB/s] model.bin: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 409M/484M [00:12<00:02, 32.6MB/s] model.bin: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 419M/484M [00:13<00:02, 27.0MB/s] model.bin: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 430M/484M [00:13<00:01, 32.5MB/s] model.bin: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 440M/484M [00:13<00:01, 31.8MB/s] model.bin: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 451M/484M [00:14<00:01, 26.9MB/s] model.bin: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 472M/484M [00:14<00:00, 35.2MB/s] model.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 482M/484M [00:15<00:00, 25.8MB/s] model.bin: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 484M/484M [00:15<00:00, 31.2MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 34, in <module> model = WhisperModel(model_size, device="cuda", compute_type="float16") File "/home/user/.local/lib/python3.10/site-packages/faster_whisper/transcribe.py", line 128, in __init__ self.model = ctranslate2.models.Whisper( RuntimeError: CUDA failed with error CUDA driver version is insufficient for CUDA runtime version

Container logs:

Fetching error logs...