Title
Using conditional random fields to predict pitch accents in conversational speech
Abstract
The detection of prosodic characteristics is an important aspect of both speech synthesis and speech recognition. Correct placement of pitch accents aids in more natural sounding speech, while automatic detection of accents can contribute to better word-level recognition and better textual understanding. In this paper we investigate probabilistic, contextual, and phonological factors that influence pitch accent placement in natural, conversational speech in a sequence labeling setting. We introduce Conditional Random Fields (CRFs) to pitch accent prediction task in order to incorporate these factors efficiently in a sequence model. We demonstrate the usefulness and the incremental effect of these factors in a sequence model by performing experiments on hand labeled data from the Switchboard Corpus. Our model outperforms the baseline and previous models of pitch accent prediction on the Switch-board Corpus.
Year
DOI
Venue
2004
10.3115/1218955.1219041
ACL
Keywords
DocType
Volume
conditional random field,switch-board corpus,speech recognition,previous model,speech synthesis,pitch accent prediction,influence pitch accent placement,accent prediction task,conversational speech,pitch accents aid,sequence model
Conference
P04-1
Citations 
PageRank 
References 
30
1.88
17
Authors
2
Name
Order
Citations
PageRank
Michelle Gregory112911.35
yasemin altun22463150.46