Spaces:
Running
Running
metadata
title: Fluency Benchmark
emoji: ποΈ
colorFrom: blue
colorTo: green
sdk: docker
app_port: 7860
app_file: app.py
pinned: false
Fluency Benchmark App
Upload an English speech audio file to measure temporal fluency β flow, continuity, and pausing behavior.
What It Measures
- Temporal fluency β flow and continuity of speech
- Pause behavior β frequency, duration, and placement of pauses
- Articulation β smoothness of delivery (legato vs staccato)
- Hesitation diagnosis β cognitive load and utterance constraints
What it does NOT measure: grammar, vocabulary, pronunciation, or accent.
Pipeline Stages
- VAD (Silero) β detects speech vs silence, computes 6 temporal features
- Transcription (WhisperX base) β word-level aligned transcript
- Placement β classifies each pause as boundary-aligned or mid-clause
- FA Features β word duration, confidence, filled pauses, speech rate CV
- Syntactic β POS-tags pauses as before content vs function words
- Inference β 6 ordinal models + 1 dominance model β predictions
- Composite β 6 dimensions weighted β percentile + confidence interval
Output
- Fluency Percentile (0-100, relative to benchmark population of 917 speakers)
- Fluency Band (LOW / MEDIUM / HIGH)
- 6 Dimension Scores (Continuity, Pause Quality, Placement, Articulation, Dominance, Word Precision)
- Ordinal Predictions (Pause Frequency, Duration, Placement, Cognitive Load, Utterance Constraints, Articulation)
- 95% Confidence Interval (Dirichlet bootstrap)