Title
Multi-stream spectro-temporal and cepstral features based on data-driven hierarchical phoneme clusters
Abstract
We propose a method to enhance multi-stream Gabor and MFCC features using data-driven hierarchical phoneme clusters to yield more discriminating posteriors. We take into account different hierarchy structures, and in addition perform mean and variance normalization. A relative improvement of 11.5% over the conventional MFCC Tandem system was achieved in experiments conducted on Mandarin broadcast news. We analyze the complementarity between Gabor and MFCC features for different types of phonemes, and investigate the benefits that come from using hierarchical phoneme clusters.
Year
DOI
Venue
2011
10.1109/ICASSP.2011.5947528
ICASSP
Keywords
Field
DocType
lvcsr,speech recognition,multistream gabor features,multistream spectro-temporal features,data-driven hierarchical phoneme clusters,clustered hierarchical mlp,mfcc features,spectro-temporal features,automatic speech recognition
Complementarity (molecular biology),Mel-frequency cepstrum,Cluster (physics),Data-driven,Normalization (statistics),Pattern recognition,Computer science,Cepstrum,Speech recognition,Artificial intelligence,Mandarin Chinese
Conference
ISSN
ISBN
Citations 
1520-6149 E-ISBN : 978-1-4577-0537-3
978-1-4577-0537-3
1
PageRank 
References 
Authors
0.35
8
3
Name
Order
Citations
PageRank
Shangwen Li1326.54
Liang-Che Sun2363.43
Lin-shan Lee31525182.03