Compiled engines for running Whisper with TRT LLM for much faster inference.
baseten
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
1
models
655
baseten/BertForSequenceClassificationTesting
Updated
•
365
baseten/smol_llama-101M-GQAForSequenceClassification
Text Classification
•
Updated
•
80
baseten/RandomQwen2ForSequenceClassification-0.5B
Text Classification
•
Updated
•
60
baseten/whisper_trt_large_v3_testrilla_post1_NVIDIA_L4_0_13_0
Updated
baseten/whisper_trt_crisper_whisper_test_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated
baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-H100-80GB-HBM3-v0.16.0-TP2
Updated
•
6
baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-H100-80GB-HBM3-v0.16.0-TP1
Updated
•
9
baseten/whisper_trt_large_v3_turbo_test_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated
baseten/whisper_trt_large_v3_testrilla_NVIDIA_L4_0_13_0
Updated
baseten/whisper_trt_large_v3_turbo_test_NVIDIA_H100_80GB_HBM3_0_13_0
Updated