NVMOS SPEAR-L9 Scorer

This repository hosts the released NVMOS downstream scorer checkpoint for non-verbal vocalization quality assessment.

Files:

  • nvmos_spear_l9.pt: PyTorch state dict for the text-query cross-attention scorer.
  • config.json: inference configuration, including upstream encoder model IDs and scorer dimensions.
  • training_run_config.json: training-time configuration record.

The full inference code is available at https://github.com/yongaifadian1/NVMOS.

Downloads last month
48
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support