Complex neural spatial filter: Enhancing multi-channel target speech separation in complex domain 阅读笔记
https://arxiv.org/abs/2104.12359
The existing models for estimating cRM are designed in the way that the real and imaginary parts of the cRM are separately modeled using real-valued training data pairs. The research motivation of this study is to design a deep model that fully exploits the temporal-spectral-spatial information of multi-channel signals for estimating cRM directly and efficiently in complex domain.
The idea is triggered by complex DNN models [DCCRN]: single-stream model instead of two-stream for real and imag parts is designed where all the network components and operations are in complex domain (such as Complex batch normalization).