Unsupervised Speaker Adaptation Using All-Phoneme Ergodic Hidden Markov Network - Citegraph

Paper Info

Title
Unsupervised Speaker Adaptation Using All-Phoneme Ergodic Hidden Markov Network

Abstract
This paper proposes an unsupervised speaker adaptation method using an ''all-phoneme ergodic Hidden Markov Network'' that combines allophonic (context-dependent phone) acoustic models with stochastic language constraints. Hidden Markov Network (HMnet) for allophone modeling and allophonic bigram probabilities derived from a large text database are combined to yield a single large ergodic HMM which represents arbitrary speech signals in a particular language so that the model parameters can be re-estimated using text-unknown speech samples with the Baum-Welch algorithm. When combined with the Vector Field Smoothing (VFS) technique, unsupervised speaker adaptation can be effectively performed. This method experimentally gave better performances compared with our previous unsupervised adaptation method which used conventional phonetic HMMs and phoneme bigram probabilities especially when the amount of training data was small.

Year	Venue	Keywords
1995	IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS	SPEECH RECOGNITION, UNSUPERVISED SPEAKER ADAPTATION, ALL-PHONEME ERGODIC HIDDEN MARKOV NETWORK, CONTEXT-DEPENDENT PHONEME BIGRAM
Field	DocType	Volume
Pattern recognition,Computer science,Ergodic theory,Speech recognition,Speaker recognition,Natural language processing,Artificial intelligence,Speaker diarisation,Hidden Markov model,Speaker adaptation	Journal	E78D
Issue	ISSN	Citations
8	0916-8532	0
PageRank	References	Authors
0.34	0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yasunage Miyazawa	1	0	0.34
Jun-Ichi Takami	2	10	5.93
Shigeki Sagayama	3	1217	137.97
Shoichi Matsunaga	4	164	36.02

1