Instructions to use eborges78/supertonic-v1-coreml with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Supertonic
How to use eborges78/supertonic-v1-coreml with Supertonic:
from supertonic import TTS tts = TTS(auto_download=True) style = tts.get_voice_style(voice_name="M1") text = "The train delay was announced at 4:45 PM on Wed, Apr 3, 2024 due to track maintenance." wav, duration = tts.synthesize(text, voice_style=style) tts.save_audio(wav, "output.wav")
- Notebooks
- Google Colab
- Kaggle
Supertonic v1 Core ML
Core ML ML Program exports of Supertonic v1 English for iOS and macOS.
This repository hosts the heavy binary artifacts for the
eborges78/supertonic-1-coreml
GitHub project. The GitHub repo holds the code, conversion scripts, runtime
contract, Swift package, tests, and CI. This Hugging Face repo holds the
weights.
What Is Included
Core ML packages (fixed-shape, ML Program, iOS 16+):
models/supertonic-v1/coreml/duration_predictor_mlprogram.mlpackagemodels/supertonic-v1/coreml/text_encoder_mlprogram.mlpackagemodels/supertonic-v1/coreml/vector_estimator_mlprogram.mlpackagemodels/supertonic-v1/coreml/vocoder_mlprogram.mlpackage
Source ONNX models (for reconverting or inspection):
resources/supertonic-v1/onnx/duration_predictor.onnxresources/supertonic-v1/onnx/text_encoder.onnxresources/supertonic-v1/onnx/vector_estimator.onnxresources/supertonic-v1/onnx/vocoder.onnx
The small runtime files (voice styles, embedding tables, tts.json,
unicode_indexer.bin) live in the GitHub repository.
Export Configuration
format=mlprogram
minimum_deployment_target=iOS16
max_text_len=300
max_seconds=20
latent_length=288
sample_rate=44100
Quick Download
From Python, using the huggingface_hub CLI:
pip install -U huggingface_hub
huggingface-cli download eborges78/supertonic-v1-coreml \
--local-dir ./supertonic-v1-coreml
Or use the helper script in the GitHub repo:
git clone https://github.com/eborges78/supertonic-1-coreml.git
cd supertonic-1-coreml
python scripts/download_models.py
Runtime Contract
The four packages are fixed-shape and expect externalized text embeddings.
See docs/runtime-contract.md
in the GitHub repo for the exact tensor shapes and the integration steps.
A minimal Swift integration reference is shipped as a SwiftPM library:
Sources/SupertonicV1CoreML/SupertonicV1CoreMLPipeline.swift.
Provenance and License
- Upstream model:
Supertone/supertonic— OpenRAIL-M. - Upstream project:
supertone-inc/supertonic. - Core ML conversion reference:
Nooder/supertonic-2-coreml.
The weights in this repository are redistributed under the upstream OpenRAIL-M
license. Use-based restrictions from the upstream license are preserved. See
the GitHub repo's MODEL_LICENSE.md
for details.
The conversion scripts and sample integration code on GitHub are MIT-licensed.
Acknowledgements
- Supertone for releasing Supertonic v1.
- The Supertonic 2 Core ML community work that informed parts of this conversion approach.
- Downloads last month
- 26