vconnx · RVC base models (pure ONNX)

Shared base components for RVC (MIT) inference in the vconnx voice-conversion library: ContentVec-768 content encoder + RMVPE pitch estimator. RVC is any-to-one — the target speaker lives in a separate per-voice net_g model (thousands of community-trained voices exist on HF); these two base models are voice-independent.

file role size
contentvec_768l12.onnx content encoder (fp32) 360 MB
contentvec_768l12_q8.onnx content encoder (int8) 91 MB
rmvpe.onnx pitch estimator (fp32) 345 MB
rmvpe_q8.onnx pitch estimator (int8) 94 MB

ContentVec export parity vs torch: max_abs 8.5e-06. Provenance in PROVENANCE.md.

Usage

from vconnx import VoiceCloner
# reference_voice = an RVC voice MODEL (.onnx path or HF id), not audio
out = VoiceCloner(engine="rvc").clone_voice("source.wav", "owner/repo::voice.onnx")
Downloads last month
18
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including TigreGotico/voiceclonnx-rvc