Abstract | ||
---|---|---|
A new speech enhancement strategy is proposed by utilizing a Bayesian nonparametric method of beta process factor analysis. As a sparse representation frame work, the dictionary learning, sparse coefficients representation and noise variance estimation are integrated into a joint procedure of Bayesian posterior estimation. The beta process is adopted as a sparse prior to infer the sparsity of the signal, and the appropriate dictionary size can be inferred nonparametrically. The dictionary training process could be performed directly on the noisy speech in situ without knowing the noise variance. Experiments were executed on noisy utterances from NOIZEUS database with SNR range from 0dB to 15dB of three types of noise, like white, train and street. And the subjective and objective measures like PESQ score and the output SegSNR are implemented to evaluate the performance of the proposed method and the other state-of-the-art methods. The corresponding results show that this proposed speech enhancement method achieves better performance both in stationary and non-stationary noise conditions. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1109/ISCSLP.2016.7918427 | 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) |
Keywords | Field | DocType |
speech enhancement,sparse representation,Bayesian nonparametric estimation,dictionary learning,beta process | Speech enhancement,Dictionary learning,Pattern recognition,Noise measurement,Computer science,Signal-to-noise ratio,Sparse approximation,Speech recognition,Nonparametric statistics,Artificial intelligence,Bayesian probability,PESQ | Conference |
ISBN | Citations | PageRank |
978-1-5090-4295-1 | 0 | 0.34 |
References | Authors | |
0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Lin Li | 1 | 323 | 79.92 |
Jiawen Wu | 2 | 3 | 2.74 |
Xinghao Ding | 3 | 591 | 52.95 |
Q. Y. Hong | 4 | 50 | 15.79 |
Delu Zeng | 5 | 164 | 11.46 |