YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
TensorRT-LLM optimized Whisper model
This repository contains a TensorRT-LLM optimized version of the Whisper model from jharshraj/whisper-indian-names.
Optimization details
- Precision: float16
- Weight quantization: int8
- Max batch size: 8
- Max beam width: 4
Usage
To use this model, you need TensorRT-LLM installed. Please refer to the TensorRT-LLM Whisper documentation for usage instructions.
- Downloads last month
- 1
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.