File size: 920 Bytes
4a26e53 41a0ab2 a6ac2f5 41a0ab2 4a26e53 41a0ab2 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 |
---
language:
- en
license: apache-2.0
tags:
- hearing loss
- challenge
- signal processing
- source separation
- audio
- audio-to-audio
- Causal
---
# Cadenza Challenge: CAD2-Task1
A Causal Sax/Others separation model for the CAD2-Task2 baseline system.
* Architecture: ConvTasNet (Kaituo XU) with multichannel support (Alexandre Defossez).
* Parameters:
* B: 256
* C: 2
* H: 512
* L: 20
* N: 256
* P: 3
* R: 3
* X: 8
* audio_channels: 2
* causal: true
* mask_nonlinear: relu
* norm_type: cLN
* training:
* sample_rate: 44100
* samples_per_track: 64
* segment: 5.0
* aggregate: 2
* batch_size: 4
* early_stop: true
* epochs: 200
## Dataset
The model was trained using EnsembleSet and CadenzaWoodwind datasets.
## How to use
```
from tasnet import ConvTasNetStereo
model = ConvTasNetStereo.from_pretrained(
"cadenzachallenge/ConvTasNet_Sax_Causal"
).cpu()
```
|