Title | ||
---|---|---|
Cluster-based approach to discriminate the user's state whether a user is embarrassed or thinking to an answer to a prompt. |
Abstract | ||
---|---|---|
Spoken dialog systems are employed in various devices to help users operate them. An advantage of a spoken dialog system is that the user can make input utterances freely, but the system sometimes makes it difficult for the user to speak to it. The system should estimate the state of a user who encounters a problem when starting a dialog and then give appropriate help before the user abandons the dialog. Based on this assumption, our research aims to construct a system which responds to a user who does not reply to the system. In this paper, we propose a method of discriminating the user’s state based on vector quantization of non-verbal information such as prosodic features, facial feature points, and gaze. The experimental results showed that the proposed method outperforms the conventional approaches and achieves a discrimination ratio of 72.0%. Then, we examined sequential discrimination for responding to the user at an appropriate timing. The results indicate that the discrimination ratio reached equal to the end of the session at around 6.0 s. |
Year | DOI | Venue |
---|---|---|
2017 | https://doi.org/10.1007/s12193-017-0238-y | J. Multimodal User Interfaces |
Keywords | Field | DocType |
Spoken dialog system,User state estimation,Audio-visual information,Sequential descrimination | Dialog box,Spoken dialog systems,Gaze,Spoken dialog,Computer science,Speech recognition,Vector quantization,Human–computer interaction,User modeling,Dialog system | Journal |
Volume | Issue | ISSN |
11 | 2 | 1783-7677 |
Citations | PageRank | References |
0 | 0.34 | 23 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yuya Chiba | 1 | 8 | 6.96 |
Takashi Nose | 2 | 399 | 39.82 |
Akinori Ito | 3 | 272 | 62.32 |