Abstract | ||
---|---|---|
In this paper, we present an approach how to evaluate text normalization for multi-lingual speech-based dialogue systems. The application of text normalization occurs within the task of music selection, which imposes several important and novel requirements on its performance. The main idea is that text normalization should determine likely user utterances from metadata that is available within a user's music collection. This is substantially different from the text preprocessing applied, for instance, in text-to-speech systems, because a) more than one normalization hypothesis may be generated, b) for media selection the information content may be reduced, which is not desirable for Text-to-speech (TTS). These factors also have an impact on evaluation.We describe an data collection effort that was carried out with the purpose of building an initial corpus of text normalization references and scorings, as well as experiments with well-known evaluation metrics from different areas of language research aiming at identifying an adequate evaluation measure. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1007/978-3-540-69369-7_7 | PIT |
Keywords | Field | DocType |
media selection,text normalization reference,likely user utterance,normalization hypothesis,data collection effort,text normalization,speech-based media selection,adequate evaluation measure,music collection,well-known evaluation metrics,different area,information content,data collection,text to speech | Data collection,Metadata,Normalization (statistics),Computer science,Machine translation,Word error rate,Speech recognition,Preprocessor,Text normalization | Conference |
Volume | ISSN | Citations |
5078 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 5 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Martin Pfeil | 1 | 11 | 1.10 |
Dirk Buehler | 2 | 0 | 0.34 |
Rainer Gruhn | 3 | 45 | 6.86 |
Wolfgang Minker | 4 | 619 | 108.61 |