Evaluating Text Normalization for Speech-Based Media Selection - Citegraph

Paper Info

Title
Evaluating Text Normalization for Speech-Based Media Selection

Abstract
In this paper, we present an approach how to evaluate text normalization for multi-lingual speech-based dialogue systems. The application of text normalization occurs within the task of music selection, which imposes several important and novel requirements on its performance. The main idea is that text normalization should determine likely user utterances from metadata that is available within a user's music collection. This is substantially different from the text preprocessing applied, for instance, in text-to-speech systems, because a) more than one normalization hypothesis may be generated, b) for media selection the information content may be reduced, which is not desirable for Text-to-speech (TTS). These factors also have an impact on evaluation.We describe an data collection effort that was carried out with the purpose of building an initial corpus of text normalization references and scorings, as well as experiments with well-known evaluation metrics from different areas of language research aiming at identifying an adequate evaluation measure.

Year	DOI	Venue
2008	10.1007/978-3-540-69369-7_7	PIT
Keywords	Field	DocType
media selection,text normalization reference,likely user utterance,normalization hypothesis,data collection effort,text normalization,speech-based media selection,adequate evaluation measure,music collection,well-known evaluation metrics,different area,information content,data collection,text to speech	Data collection,Metadata,Normalization (statistics),Computer science,Machine translation,Word error rate,Speech recognition,Preprocessor,Text normalization	Conference
Volume	ISSN	Citations
5078	0302-9743	0
PageRank	References	Authors
0.34	5	4

Authors (4 rows)

Cited by (0 rows)

References (5 rows)

Name	Order	Citations	PageRank
Martin Pfeil	1	11	1.10
Dirk Buehler	2	0	0.34
Rainer Gruhn	3	45	6.86
Wolfgang Minker	4	619	108.61

1