Title
Unsupervised learning of phonemes of whispered speech in a noisy environment based on convolutive non-negative matrix factorization
Abstract
This paper focuses on the development of an algorithm that can be optimized for a specific acoustic environment to improve the intelligibility of whispered speech. A new convolutive non-negative matrix factorization (NMF) algorithm is proposed to extract phoneme bases from noisy whispered speech with the noise bases from prior learning; these noise bases are obtained from training using the conventional non-negative matrix factorization. The divergence function with a sparseness constraint term is selected as the objective function in the developed algorithm to obtain multiplicative update rules of the phoneme base matrix and the corresponding weight matrix. The weights of the noise bases from prior learning are also updated in the phoneme learning stage. Listening experiments were conducted to assess the intelligibility performance of speech synthesized using the proposed algorithm. The experimental results indicate that the proposed algorithm is very effective for improving the intelligibility of whispers in various noise contexts, and it outperforms conventional algorithms.
Year
DOI
Venue
2014
10.1016/j.ins.2013.09.037
Inf. Sci.
Keywords
DocType
Volume
unsupervised learning,developed algorithm,noisy environment,various noise context,phoneme base matrix,conventional algorithm,noise base,conventional non-negative matrix factorization,new convolutive non-negative matrix,corresponding weight matrix,prior learning,proposed algorithm,non negative matrix factorization
Journal
257,
ISSN
Citations 
PageRank 
0020-0255
4
0.42
References 
Authors
30
5
Name
Order
Citations
PageRank
Jian Zhou172.50
Ruiyu Liang23513.15
Li Zhao338027.36
Liang Tao440.42
Cairong Zou541527.19