Title
Evaluating Text Normalization for Speech-Based Media Selection
Abstract
In this paper, we present an approach how to evaluate text normalization for multi-lingual speech-based dialogue systems. The application of text normalization occurs within the task of music selection, which imposes several important and novel requirements on its performance. The main idea is that text normalization should determine likely user utterances from metadata that is available within a user's music collection. This is substantially different from the text preprocessing applied, for instance, in text-to-speech systems, because a) more than one normalization hypothesis may be generated, b) for media selection the information content may be reduced, which is not desirable for Text-to-speech (TTS). These factors also have an impact on evaluation.We describe an data collection effort that was carried out with the purpose of building an initial corpus of text normalization references and scorings, as well as experiments with well-known evaluation metrics from different areas of language research aiming at identifying an adequate evaluation measure.
Year
DOI
Venue
2008
10.1007/978-3-540-69369-7_7
PIT
Keywords
Field
DocType
media selection,text normalization reference,likely user utterance,normalization hypothesis,data collection effort,text normalization,speech-based media selection,adequate evaluation measure,music collection,well-known evaluation metrics,different area,information content,data collection,text to speech
Data collection,Metadata,Normalization (statistics),Computer science,Machine translation,Word error rate,Speech recognition,Preprocessor,Text normalization
Conference
Volume
ISSN
Citations 
5078
0302-9743
0
PageRank 
References 
Authors
0.34
5
4
Name
Order
Citations
PageRank
Martin Pfeil1111.10
Dirk Buehler200.34
Rainer Gruhn3456.86
Wolfgang Minker4619108.61