Monaural Source Separation: From Anechoic To Reverberant Environments - Citegraph

Paper Info

Title
Monaural Source Separation: From Anechoic To Reverberant Environments

Abstract
Impressive progress in neural network-based single-channel speech source separation has been made in recent years. But those improvements have been mostly reported on anechoic data, a situation that is hardly met in practice. Taking the SepFormer as a starting point, which achieves state-of-the-art performance on anechoic mixtures, we gradually modify it to optimize its performance on reverberant mixtures. Although this leads to a word error rate improvement by 7 percentage points compared to the standard SepFormer implementation, the system ends up with only marginally better performance than a PIT-BLSTM separation system, that is optimized with rather straightforward means. This is surprising and at the same time sobering, challenging the practical usefulness of many improvements reported in recent years for monaural source separation on nonrever-berant data.

Year	DOI	Venue
2022	10.1109/IWAENC53105.2022.9914794	2022 International Workshop on Acoustic Signal Enhancement (IWAENC)
Keywords	DocType	ISBN
speech separation,deep learning,SepFormer,automatic speech recognition,reverberation	Conference	978-1-6654-6868-8
Citations	PageRank	References
0	0.34	7
Authors
6

Authors (6 rows)

Cited by (0 rows)

References (7 rows)

Name	Order	Citations	PageRank
Tobias Cord-Landwehr	1	0	0.68
Boeddeker Christoph	2	3	3.84
Thilo von Neumann	3	6	2.57
Catalin Zorila	4	2	2.74
Rama Doddipatla	5	2	4.09
Reinhold Haeb-Umbach	6	1487	211.71

1