An Acoustic Segment Model Approach To Incorporating Temporal Information Into Speaker Modeling For Text-Independent Speaker Recognition - Citegraph

Paper Info

Title
An Acoustic Segment Model Approach To Incorporating Temporal Information Into Speaker Modeling For Text-Independent Speaker Recognition

Abstract
We propose an acoustic segment model (ASM) approach to incorporating temporal information into speaker modeling in text-independent speaker recognition. In training, the proposed framework first estimates a collection of ASM-based universal background models (UBMs). Multiple sets of speaker-specific ASMs are then obtained by adapting the ASM-based UBMs with speaker-specific enrollment data. A novel usage of language models of the ASM units is also proposed to characterize transitions among ASMs. In the testing phase the ASM sets for the claimed speaker and UBMs, along with a bigram ASM language model, are used to calculate detection scores for each given test utterance. We report on speaker recognition experiments using the NIST 2001 SRE database. The results clearly indicate that the proposed ASM-based method achieves a notable improvement over the GMM-based speaker modeling in which no temporal modeling is considered. Moreover, a further error reduction is obtained by integrating the language model, another inclusion of temporal properties made possibly by ASM based speaker modeling.

Year	DOI	Venue
2010	10.1109/ICASSP.2010.5495617	2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING
Keywords	Field	DocType
Speaker recognition, acoustic segment model	Pattern recognition,Computer science,Utterance,Speech recognition,Speaker recognition,NIST,Artificial intelligence,Bigram,Temporal modeling,Speaker diarisation,Hidden Markov model,Language model	Conference
ISSN	Citations	PageRank
1520-6149	2	0.38
References	Authors
14	4

Authors (4 rows)

Cited by (2 rows)

References (14 rows)

Name	Order	Citations	PageRank
Yu Tsao	1	60	16.52
Hanwu Sun	2	98	14.15
Haizhou Li	3	3678	334.61
Chin-Hui Lee	4	6101	852.71

1