Automatic Speech Recognition
Transformers
Safetensors
PyTorch
arkasr
text-generation
speech
audio
vllm
ark-asr
custom_code
Eval Results
Instructions to use AutoArk-AI/ARK-ASR-3B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use AutoArk-AI/ARK-ASR-3B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="AutoArk-AI/ARK-ASR-3B", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("AutoArk-AI/ARK-ASR-3B", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Transcribing Long Audio Files
#2
by HBotond - opened
Hi!
What is the recommended approach for transcribing long audio files (around 30 minutes)? Can the model process the entire recording in a single request, or is it better to split the audio into smaller segments? If chunking is recommended, what segment length do you suggest?
Thanks in advance for your help!
Just make sure each segment is no longer than 30 seconds.