Title
DOA-guided source separation with direction-based initialization and time annotations using complex angular central Gaussian mixture models
Abstract
By means of spatial clustering and time-frequency masking, a mixture of multiple speakers and noise can be separated into the underlying signal components. The parameters of a model, such as a complex angular central Gaussian mixture model (cACGMM), can be determined based on the given signal mixture itself. Then, no misfit between training and testing conditions arises, as opposed to approaches that require labeled datasets to be trained. Whereas the separation can be performed in a completely unsupervised way, it may be beneficial to take advantage of a priori knowledge. The parameter estimation is sensitive to the initialization, and it is necessary to address the frequency permutation problem. In this paper, we therefore consider three techniques to overcome these limitations using direction of arrival (DOA) estimates. First, we propose an initialization with simple DOA-based masks. Secondly, we derive speaker specific time annotations from the same masks in order to constrain the cACGMM. Thirdly, we employ an approach where the mixture components are specific to each DOA instead of each speaker. We conduct experiments with sudden DOA changes, as well as a gradually moving speaker. The results demonstrate that particularly the DOA-based initialization is effective to overcome both of the described limitations. In this case, even methods based on normally unavailable oracle information are not observed to be more beneficial to the permutation resolution or the initialization. Lastly, we also show that the proposed DOA-guided source separation works quite robustly in the presence of adverse conditions and realistic DOA estimation errors.
Year
DOI
Venue
2022
10.1186/s13636-022-00246-7
EURASIP Journal on Audio, Speech, and Music Processing
Keywords
DocType
Volume
Guided source separation, Spatial clustering, Direction of arrival, Time-frequency masks
Journal
2022
Issue
ISSN
Citations 
1
1687-4722
0
PageRank 
References 
Authors
0.34
9
4
Name
Order
Citations
PageRank
Alexander Bohlender101.35
Lucas Van Severen200.34
Jonathan Sterckx300.34
Nilesh Madhu402.03