Cocktail-Fork MRX (MLX)
Collection
MERL MRX ported to Apple MLX — 3-stem music/speech/sfx soundtrack separation. Numerically exact vs PyTorch. 4 variants. • 4 items • Updated
How to use mlx-community/Cocktail-Fork-MRX-adapted-loudness with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir Cocktail-Fork-MRX-adapted-loudness mlx-community/Cocktail-Fork-MRX-adapted-loudness
adapted-loudness variant (MLX)
Apple MLX port of MERL's MRX (Multi-Resolution CrossNet) — separates a soundtrack mixture into music, speech, and sound effects (sfx).
This variant uses the adapted_loudness_ checkpoint: adapted with
loudness normalization for better alignment with real cinematic/movie stems.
Try this (and -adapted-eq) on real soundtrack content.
Other variants: Cocktail-Fork-MRX (default) ·
-paper (ICASSP reproduction) ·
-adapted-eq.
6e-7).pip install git+https://github.com/xocialize/cocktail-fork-mlx
cocktail-fork-mlx --audio-path soundtrack.wav --out-dir ./out \
--weights mlx-community/Cocktail-Fork-MRX-adapted-loudness
~30.6M params, fp32 (122 MB), 44.1 kHz. MIT, © MERL for the original model/weights.
Quantized