Title
Recursive likelihood evaluation and fast search algorithm for polynomial segment model with application to speech recognition
Abstract
Polynomial segment models (PSMs), which are generalization of the hidden Markov models (HMMs), have opened an alternative research direction for speech recognition. However, they have been limited by their computational complexity. Traditionally, any change in PSM segment boundary requires likelihood recomputation of all the frames within the segment. This makes the PSM's segment likelihood evaluation an order of magnitude more expensive than the HMM's. Furthermore, because recognition using segment models needs to search over all possible segment boundaries, the PSM recognition is computationally unfeasible beyond N-best rescoring. By exploiting the properties of the time normalization in PSM, and by decomposing the PSM segment likelihood into a simple function of "sufficient statistics", in this paper, we show that segment likelihood can be evaluated efficiently in an order of computational complexity similar to HMM. In addition, by reformulating the PSM recognition as a search for the optimal path through a graph, this paper introduces a fast PSM search algorithm that intelligently prunes the number of hypothesized segment boundaries, such that PSM recognition can be performed in an order of complexity similar to HMM. We demonstrate the effectiveness of the proposed algorithms with experiments using a PSM-based recognition system on two different recognition tasks: TIDIGIT digit recognition and the Wall Street Journal dictation task. In both tasks, PSM recognition is feasible and out-performed traditional HMM by more than 14%
Year
DOI
Venue
2006
10.1109/TSA.2005.858553
IEEE Transactions on Audio, Speech & Language Processing
Keywords
Field
DocType
psm segment boundary,psm recognition,recursive likelihood evaluation,speech recognition,tidigit digit recognition,wall street journal dictation task,time normalization,psm-based recognition system,possible segment boundary,fast search algorithm,search problems,computational complexity,fast algorithm,polynomial segment model,different recognition task,segment likelihood,hidden markov models,fast psm search algorithm,psm segment likelihood,search,search algorithm,hidden markov model
Speech processing,Normalization (statistics),Search algorithm,Pattern recognition,Polynomial,Segmentation,Computer science,Speech recognition,Artificial intelligence,Hidden Markov model,Recursion,Computational complexity theory
Journal
Volume
Issue
ISSN
14
5
1558-7916
Citations 
PageRank 
References 
6
0.51
24
Authors
3
Name
Order
Citations
PageRank
Chak-Fai LI1201.68
Manhung Siu246461.40
Jeff Siu-Kei Au-Yeung3647.75