Cocktail-Fork-MRX — `adapted-loudness` variant (MLX)

Apple MLX port of MERL's MRX (Multi-Resolution CrossNet) — separates a soundtrack mixture into music, speech, and sound effects (sfx).

This variant uses the adapted_loudness_ checkpoint: adapted with loudness normalization for better alignment with real cinematic/movie stems. Try this (and -adapted-eq) on real soundtrack content.

Other variants: Cocktail-Fork-MRX (default) · -paper (ICASSP reproduction) · -adapted-eq.

Upstream: merlresearch/cocktail-fork-separation (MIT) · The Cocktail Fork Problem, ICASSP 2022.
Parity: numerically exact vs PyTorch (full-forward max_abs ≈ 6e-7).

Usage

pip install git+https://github.com/xocialize/cocktail-fork-mlx
cocktail-fork-mlx --audio-path soundtrack.wav --out-dir ./out \
    --weights mlx-community/Cocktail-Fork-MRX-adapted-loudness

~30.6M params, fp32 (122 MB), 44.1 kHz. MIT, © MERL for the original model/weights.

Downloads last month: 9

Safetensors

Model size

30.6M params

Tensor type

F32

MLX

Hardware compatibility

Quantized

Collection including mlx-community/Cocktail-Fork-MRX-adapted-loudness

Cocktail-Fork MRX (MLX)

Collection

MERL MRX ported to Apple MLX — 3-stem music/speech/sfx soundtrack separation. Numerically exact vs PyTorch. 4 variants. • 4 items • Updated about 21 hours ago

Cocktail-Fork-MRX — adapted-loudness variant (MLX)

Usage

Collection including mlx-community/Cocktail-Fork-MRX-adapted-loudness

Cocktail-Fork-MRX — `adapted-loudness` variant (MLX)