Parakeet TDT 0.6B V2 - OpenVINO

Discord GitHub Repo stars

OpenVINO-optimized version of NVIDIA's Parakeet TDT 0.6B V2 model for high-performance automatic speech recognition on Intel NPUs and CPUs.

Benchmark Results

Hardware: Intel Core Ultra 7 155H (Meteor Lake) with Intel AI Boost NPU Dataset: LibriSpeech test-clean (2,620 files, 5.4 hours) Software: OpenVINO 2025.x

Metric Value
Average WER 2.87%
Median WER 0.00%
Average CER 1.07%
RTFx (NPU) 37.8×
RTFx (CPU) 5-8×
Total processing time 514.7s

Performance Comparison

Implementation Device RTFx
eddy (OpenVINO) Intel Core Ultra 7 155H NPU 37.8×
Parakeet (PyTorch) Intel Arc 140V GPU 19.8×
eddy (OpenVINO) Intel Core Ultra 7 155H CPU 5-8×

Note: Benchmarked on HP EliteBook Ultra G1i. eddy NPU is 1.9× faster than PyTorch on Intel Arc GPU, with lower power consumption.

Usage

Python usage via ctypes available - see eddy repository for details.

Model Details

  • Parameters: 600M
  • Architecture: FastConformer-RNNT (4-model pipeline)
  • Language: English only
  • Blank token ID: 1024
  • Context window: 10s chunks with 3s overlap
  • Features: LSTM state continuity, token deduplication, per-token timestamps

License

CC-BY-4.0 - See LICENSE for details.

Links

Acknowledgments

Based on NVIDIA's Parakeet TDT model. OpenVINO conversion and optimization by the FluidInference team.

Downloads last month
8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for FluidInference/parakeet-tdt-0.6b-v2-ov

Finetuned
(16)
this model

Collection including FluidInference/parakeet-tdt-0.6b-v2-ov