Title
An Automatic Prosody Recognizer Using A Coupled Multi-Stream Acoustic Model And A Syntactic-Prosodic Language Model
Abstract
Automatic detection and labeling of prosodic events in speech has received much attention from speech technologists and linguists ever since the introduction of annotation standards such as ToBI. Since prosody is intricately bound to the semantics of the utterance, recognition of prosodic events is important for spoken language applications such as automatic understanding and translation of speech. Moreover, corpora labeled with prosodic markers are essential for building speech synthesizers that use data-driven approaches to generate C, natural speech. In this paper, we build a prosody recognition system that detects stress and prosodic boundaries at the word and syllable level in American English using a coupled Hidden Markov Model (CHMM) to model multiple, asynchronous acoustic feature streams and a syntactic-prosodic model that captures the relationship between the syntax of the utterance and its prosodic structure. Experiments show that the recognizer achieves about 75% agreement on stress labeling and 88% agreement on boundary labeling at the syllable level.
Year
DOI
Venue
2005
10.1109/ICASSP.2005.1415102
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING
Keywords
Field
DocType
language model,speech synthesis,stress,natural languages,automatic speech recognition,hidden markov models,speech recognition,annotation,labeling
Prosody,Speech synthesis,Computer science,Utterance,Speech recognition,Natural language,Artificial intelligence,Natural language processing,Syllable,Speech translation,Language model,Acoustic model
Conference
ISSN
Citations 
PageRank 
1520-6149
21
1.15
References 
Authors
3
2
Name
Order
Citations
PageRank
Sankaranarayanan Ananthakrishnan113413.29
Narayanan Shrikanth25558439.23