Spaces:
Sleeping
Sleeping
metadata
title: Busy Module Audio Features
emoji: 🎤
colorFrom: indigo
colorTo: purple
sdk: docker
app_port: 7860
pinned: false
Busy Module Audio Features
Audio Feature Extraction API
This Space extracts 17 voice features from audio, including SNR, noise classification, speech rate, pitch, energy, pause analysis, and emotion features.
API
POST /extract-audio-features-base64
{
"audio_base64": "<base64-encoded-wav>",
"transcript": "I'm driving right now"
}
POST /extract-audio-features (multipart form)
audio: audio file uploadtranscript: text transcript
GET /health
Authentication
This Space requires access to private models. Add your Hugging Face token as a secret:
- Go to Settings -> Variables and secrets.
- Click New secret.
- Name it
HF_TOKEN. - Set the value to a Hugging Face access token with read permissions.