Role play dialogue topic model for language model adaptation in multi-party conversation speech recognition - Citegraph

Paper Info

Title
Role play dialogue topic model for language model adaptation in multi-party conversation speech recognition

Abstract
This paper introduces an unsupervised language model adaptation technique for multi-party conversation speech recognition. The use of topic models provides one of the most accurate frameworks for unsupervised language model adaptation since they can inject long-range topic information into language models. However, conventional topic models are not suitable for multi-party conversation because they assume that each speech set has each different topic. In a multi-party conversation, each speaker will share the same conversation topic and each speaker utterance will depend on both topic and speaker role. Accordingly, this paper proposes new concept of the “role play dialogue topic model” to utilize multiparty conversation attributes. The proposed topic model can share the topic distribution among each speaker and can also consider both topic and speaker role. The proposed topic model based adaptation realizes a new framework that sets multiple recognition hypotheses for each speaker and simultaneously adapts a language model for each speaker role. We use a call center dialogue data set in speech recognition experiments to show the effectiveness of the proposed method.

Year	DOI	Venue
2014	10.1109/ICASSP.2014.6854528	Acoustics, Speech and Signal Processing
Keywords	Field	DocType
natural language processing,speech recognition,unsupervised learning,call center dialogue data set,long-range topic information,multiparty conversation speech recognition,multiple recognition hypotheses,role play dialogue topic model,speaker role,speaker utterance,speech set,topic distribution,unsupervised language model adaptation technique,Unsupervised language model adaptation,multi-party conversation speech recognition,topic model	Conversation,Computer science,Utterance,Speech recognition,Speaker recognition,Artificial intelligence,Natural language processing,Speaker diarisation,Topic model,Language model	Conference
ISSN	Citations	PageRank
1520-6149	0	0.34
References	Authors
15	4

Authors (4 rows)

Cited by (0 rows)

References (15 rows)

Name	Order	Citations	PageRank
Ryo Masumura	1	25	28.24
Takanobu Oba	2	53	12.09
Hirokazu Masataki	3	18	9.21
Osamu Yoshioka	4	29	5.66

1