Title
Prediction of Who Will Be the Next Speaker and When Using Gaze Behavior in Multiparty Meetings.
Abstract
In multiparty meetings, participants need to predict the end of the speaker’s utterance and who will start speaking next, as well as consider a strategy for good timing to speak next. Gaze behavior plays an important role in smooth turn-changing. This article proposes a prediction model that features three processing steps to predict (I) whether turn-changing or turn-keeping will occur, (II) who will be the next speaker in turn-changing, and (III) the timing of the start of the next speaker’s utterance. For the feature values of the model, we focused on gaze transition patterns and the timing structure of eye contact between a speaker and a listener near the end of the speaker’s utterance. Gaze transition patterns provide information about the order in which gaze behavior changes. The timing structure of eye contact is defined as who looks at whom and who looks away first, the speaker or listener, when eye contact between the speaker and a listener occurs. We collected corpus data of multiparty meetings, using the data to demonstrate relationships between gaze transition patterns and timing structure and situations (I), (II), and (III). The results of our analyses indicate that the gaze transition pattern of the speaker and listener and the timing structure of eye contact have a strong association with turn-changing, the next speaker in turn-changing, and the start time of the next utterance. On the basis of the results, we constructed prediction models using the gaze transition patterns and timing structure. The gaze transition patterns were found to be useful in predicting turn-changing, the next speaker in turn-changing, and the start time of the next utterance. Contrary to expectations, we did not find that the timing structure is useful for predicting the next speaker and the start time. This study opens up new possibilities for predicting the next speaker and the timing of the next utterance using gaze transition patterns in multiparty meetings.
Year
DOI
Venue
2016
10.1145/2757284
TiiS
Keywords
Field
DocType
Turn-changing, gaze behavior, multiparty meetings, next speaker prediction, speech timing prediction
Gaze,Computer science,Utterance,Speech recognition,Speaker recognition,Artificial intelligence,Natural language processing,Speaker diarisation,Eye contact
Journal
Volume
Issue
ISSN
6
1
2160-6455
Citations 
PageRank 
References 
9
0.49
12
Authors
4
Name
Order
Citations
PageRank
Ryo Ishii115516.59
Kazuhiro Otsuka261954.15
Shiro Kumano314916.82
Junji Yamato41120165.72