Abstract | ||
---|---|---|
The speech dialogue system has gradually been widely used in daily life. Users can consult and communicate with the system through natural language. However, in practical applications, third-person background sounds and background noise interference in real dialogue scenes will be encountered. The uncertainty and complexity of these background sounds will have a bad impact on the recognition of the system. A good speech enhancement module can help us to separate the target speaker from the original speech. Recently, a solution called SpEx+ was proposed from the time domain, but SpEx+ needs a reference speech to assist in training. This reference speech may have noise in actual applications that will affect performance. Therefore, we propose a Denoi-SpEx+ model. Before the reference speech is input to the network, a speech denoising network is added, so that the quality of speech separation in practical applications can be guaranteed. Experiments show that our model can significantly improve the performance of speech separation model of noisy reference speech. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/ICEBE52470.2021.00030 | 2021 IEEE International Conference on e-Business Engineering (ICEBE) |
Keywords | DocType | ISBN |
speech dialogue system,speech separation,Denoi-SpEx+ | Conference | 978-1-6654-4419-4 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yun Hao | 1 | 0 | 1.35 |
Xiangkang Huang | 2 | 1 | 1.36 |
Huichou Huang | 3 | 1 | 2.04 |
Wu Qingyao | 4 | 259 | 33.46 |