Sinusoidal masks for single channel speech separation

TitleSinusoidal masks for single channel speech separation
Publication TypeConference Paper
Year of Publication2010
AuthorsMowlaee, P., Christensen M. G., & Jensen S. H.
Conference NameAcoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Pages4262 -4265
Date Publishedmarch

In this paper we present a new approach for binary and soft masks used in single-channel speech separation. We present a novel approach called the sinusoidal mask (binary mask and Wiener filter) in a sinusoidal space. Theoretical analysis is presented for the proposed method, and we show that the proposed method is able to minimize the target speech distortion while suppressing the crosstalk to a predetermined threshold. It is observed that compared to the STFT-based masks, the proposed sinusoidal masks improve the separation performance in terms of objective measures (SSNR and PESQ) and are mostly preferred by listeners.


Citation Key5495679