Title
Multi-view learning with supervision for transformed bottleneck features
Abstract
Previous work has shown that acoustic features can be improved by unsupervised learning of transformations based on canonical correlation analysis (CCA) using articulatory measurements that are available at training time. In this paper, we investigate whether this second view (articulatory data) still helps even when labels are also available at training time. We begin with strong baseline bottleneck features, which can be learned when the training set is phonetically labeled. We then compare several options for learning transformations of the bottleneck features in the presence of both articulatory measurements and phonetic labels for the training data. The methods compared include combinations of LDA and CCA, as well as a three-view extension of CCA that simultaneously uses the labels and articulatory measurements as additional views. Phonetic recognition experiments on data from the University of Wisconsin X-ray microbeam database show that the learned features improve performance over using either just the labels or just the articulatory measurements for learning acoustic transformations.
Year
DOI
Venue
2014
10.1109/ICASSP.2014.6854050
Acoustics, Speech and Signal Processing
Keywords
Field
DocType
acoustic signal processing,correlation theory,speech recognition,unsupervised learning,CCA,LDA,Wisconsin University,X-ray microbeam database,acoustic features,acoustic transformations,articulatory data,articulatory measurements,baseline bottleneck features,canonical correlation analysis,linear discriminant analysis,multiview learning,phonetic labels,phonetic recognition,supervision,three-view extension,training time,transformed bottleneck features,unsupervised learning,articulatory measurements,bottleneck features,canonical correlation analysis,multi-view learning,supervised transformation learning
Training set,Bottleneck,Pattern recognition,Computer science,Canonical correlation,Speech recognition,Unsupervised learning,Artificial intelligence
Conference
ISSN
Citations 
PageRank 
1520-6149
12
0.60
References 
Authors
21
2
Name
Order
Citations
PageRank
R. Arora148935.97
Karen Livescu2125471.43