Title | ||
---|---|---|
Multi-stream spectro-temporal and cepstral features based on data-driven hierarchical phoneme clusters |
Abstract | ||
---|---|---|
We propose a method to enhance multi-stream Gabor and MFCC features using data-driven hierarchical phoneme clusters to yield more discriminating posteriors. We take into account different hierarchy structures, and in addition perform mean and variance normalization. A relative improvement of 11.5% over the conventional MFCC Tandem system was achieved in experiments conducted on Mandarin broadcast news. We analyze the complementarity between Gabor and MFCC features for different types of phonemes, and investigate the benefits that come from using hierarchical phoneme clusters. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1109/ICASSP.2011.5947528 | ICASSP |
Keywords | Field | DocType |
lvcsr,speech recognition,multistream gabor features,multistream spectro-temporal features,data-driven hierarchical phoneme clusters,clustered hierarchical mlp,mfcc features,spectro-temporal features,automatic speech recognition | Complementarity (molecular biology),Mel-frequency cepstrum,Cluster (physics),Data-driven,Normalization (statistics),Pattern recognition,Computer science,Cepstrum,Speech recognition,Artificial intelligence,Mandarin Chinese | Conference |
ISSN | ISBN | Citations |
1520-6149 E-ISBN : 978-1-4577-0537-3 | 978-1-4577-0537-3 | 1 |
PageRank | References | Authors |
0.35 | 8 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shangwen Li | 1 | 32 | 6.54 |
Liang-Che Sun | 2 | 36 | 3.43 |
Lin-shan Lee | 3 | 1525 | 182.03 |