Comparison of Tied-Mixture and State-Clustered HMMs with Respect to Recognition Performance and Training Method - Citegraph

Paper Info

Title
Comparison of Tied-Mixture and State-Clustered HMMs with Respect to Recognition Performance and Training Method

Abstract
Tied-mixture HMMs have been proposed as the acoustic model for large-vocabulary continuous speech recognition and have yielded promising results. They share base-distribution and provide more flexibility in choosing the degree of tying than state-clustered HMMs. However, it is unclear which acoustic models to superior to the other under the same training data. Moreover, LBG algorithm and EM algorithm, which are the usual training methods for HMMs, have not been compared. Therefore in this paper, the recognition performance of the respective HMMs and the respective training methods are compared under the same condition. It was found that the number of parameters and the word error rate for both HMMs are equivalent when the number of codebooks is sufficiently large. It was also found that training method using the LBG algorithm achieves a 90% reduction in training time compared to training method using the EM algorithm, without degradation of recognition accuracy.

Year	DOI	Venue
2014	10.4018/jitr.2014070102	Journal of Information Technology Research
Keywords	Field	DocType
state-clustered hmm,speech recognition,acoustic model,lbg algorithm,tied-mixture hmm,speech database,training method,em algorithm	Training set,Pattern recognition,Computer science,Expectation–maximization algorithm,Word error rate,Speech recognition,Artificial intelligence,Acoustic model	Journal
Volume	Issue	ISSN
7	3	1938-7857
Citations	PageRank	References
0	0.34	17
Authors
5

Authors (5 rows)

Cited by (0 rows)

References (17 rows)

Name	Order	Citations	PageRank
Hiroyuki Segi	1	12	2.78
Kazuo Onoe	2	33	6.40
Shoei Sato	3	31	7.99
Akio Kobayashi	4	34	8.35
A. Ando	5	55	12.03

1