Title | ||
---|---|---|
Yelling at Your TV: An Analysis of Speech Recognition Errors and Subsequent User Behavior on Entertainment Systems |
Abstract | ||
---|---|---|
Millions of consumers issue voice queries through television-based entertainment systems such as the Comcast X1, the Amazon Fire TV, and Roku TV. Automatic speech recognition (ASR) systems are responsible for transcribing these voice queries into text to feed downstream natural language understanding modules. However, ASR is far from perfect, often producing incorrect transcriptions and forcing users to take corrective action. To better understand their impact on sessions, this paper characterizes speech recognition errors as well as subsequent user responses. We provide both quantitative and qualitative analyses, examining the acoustic as well as lexical attributes of the utterances. This work represents, to our knowledge, the first analysis of speech recognition errors from real users on a widely-deployed entertainment system.
|
Year | DOI | Venue |
---|---|---|
2019 | 10.1145/3331184.3331271 | Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval |
Keywords | Field | DocType |
error characterization, intelligent agents, voice search | Computer science,Entertainment,Speech recognition | Conference |
ISBN | Citations | PageRank |
978-1-4503-6172-9 | 1 | 0.36 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Raphael Tang | 1 | 13 | 6.71 |
Ferhan Türe | 2 | 102 | 9.45 |
Jimmy Lin | 3 | 4800 | 376.93 |