TDANetBest-2ms-LRS2 / README.md
JusperLee's picture
Create README.md
4fc5428
metadata
license: apache-2.0
language:
  - en
tags:
  - audio
  - audio-source-separation
pipeline_tag: audio-to-audio

An efficient encoder-decoder architecture with top-down attention for speech separation

PWC PWC

This repository is the official implementation of An efficient encoder-decoder architecture with top-down attention for speech separation Paper link.

@inproceedings{tdanet2023iclr,
  title={An efficient encoder-decoder architecture with top-down attention for speech separation},
  author={Li, Kai and Yang, Runxuan and Hu, Xiaolin},
  booktitle={ICLR},
  year={2023}
}

Training Dataset

  • LRS2-2Mix

Config

    enc_kernel_size: 2
    in_channels: 512
    num_blocks: 16
    num_sources: 2
    out_channels: 128
    upsampling_depth: 5