Abstract | ||
---|---|---|
We address the problem of localized error detection in Automatic Speech Recognition (ASR) output. Localized error detection seeks to identify which particular words in a user's utterance have been misrecognized. Identifying misrecognized words permits one to create targeted clarification strategies for spoken dialogue systems, allowing the system to ask clarification questions targeting the particular type of misrecognition, in contrast to the “please repeat/rephrase” strategies used in most current dialogue systems. We present results of machine learning experiments using ASR confidence scores together with prosodic and syntactic features to predict whether 1) an utterance contains an error, and 2) whether a word in a misrecognized utterance is misrecognized. We show that by adding syntactic features to the ASR features when predicting misrecognized utterances the F-measure improves by 13.3% compared to using ASR features alone. By adding syntactic and prosodic features when predicting misrecognized words F-measure improves by 40%. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1109/SLT.2012.6424164 | SLT |
Keywords | Field | DocType |
speech recognition errors,localized error detection,speech recognition,spoken dialogue systems,learning (artificial intelligence),asr,machine learning,automatic speech recognition,learning artificial intelligence | Ask price,Pattern recognition,Computer science,Utterance,Error detection and correction,Speech recognition,Artificial intelligence,Natural language processing,Syntax | Conference |
ISSN | ISBN | Citations |
2639-5479 | 978-1-4673-5124-9 | 10 |
PageRank | References | Authors |
0.69 | 12 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Svetlana Stoyanchev | 1 | 104 | 13.61 |
Philipp Salletmayr | 2 | 10 | 0.69 |
Jingbo Yang | 3 | 12 | 1.92 |
Julia Hirschberg | 4 | 2982 | 448.62 |