Title
A stereophonic acoustic signal extraction scheme for noisy and reverberant environments
Abstract
In this contribution, a novel two-channel acoustic front-end for robust automatic speech recognition in adverse acoustic environments with nonstationary interference and reverberation is proposed. From a MISO system perspective, a statistically optimum source signal extraction scheme based on the multichannel Wiener filter (MWF) is discussed for application in noisy and underdetermined scenarios. For free-field and diffuse noise conditions, this optimum scheme reduces to a Delay & Sum beamformer followed by a single-channel Wiener postfilter. Scenarios with multiple simultaneously interfering sources and background noise are usually modeled by a diffuse noise field. However, in reality, the free-field assumption is very weak because of the reverberant nature of acoustic environments. Therefore, we propose to estimate this simplified MWF solution in each frequency bin separately to cope with reverberation. We show that this approach can very efficiently be realized by the combination of a blocking matrix based on semi-blind source separation ('directional BSS'), which provides a continuously updated reference of all undesired noise and interference components separated from the desired source and its reflections, and a single-channel Wiener postfilter. Moreover, it is shown, how the obtained reference signal of all undesired components can efficiently be used to realize the Wiener postfilter, and at the same time, generalizes well-known postfilter realizations. The proposed front-end and its integration into an automatic speech recognition (ASR) system are analyzed and evaluated in noisy living-room-like environments according to the PASCAL CHiME challenge. A comparison to a simplified front-end based on a free-field assumption shows that the introduced system substantially improves the speech quality and the recognition performance under the considered adverse conditions.
Year
DOI
Venue
2013
10.1016/j.csl.2012.07.011
Computer Speech & Language
Keywords
Field
DocType
reverberant environment,generalizes well-known postfilter realization,diffuse noise field,free-field assumption,wiener postfilter,acoustic environment,undesired noise,diffuse noise condition,single-channel wiener postfilter,background noise,stereophonic acoustic signal extraction,miso system perspective
Wiener filter,Speech enhancement,Reverberation,Background noise,Underdetermined system,Computer science,Stereophonic sound,Speech recognition,Interference (wave propagation),Source separation
Journal
Volume
Issue
ISSN
27
3
0885-2308
Citations 
PageRank 
References 
19
0.79
17
Authors
7
Name
Order
Citations
PageRank
Klaus Reindl1504.50
Yuanhang Zheng2584.24
Andreas Schwarz3292.73
Stefan Meier4334.12
Roland Maas520617.08
Armin Sehr617910.82
Walter Kellermann753545.32