Title
Analysis on paralinguistic prosody control in perceptual impression space using multiple dimensional scaling
Abstract
A multi-dimensional perceptual space for communicative speech prosodies was derived using a psychometric method from multi-dimensional expressions of impressions to characterize paralinguistic information conveyed by prosody in communication. Single word utterances of ''n'' were employed to allow freedom from lexical effects and to cover communicative prosodic variations as much as possible. The analysis of daily conversations showed that conversational speech impressions were manifested in the global F0 control of ''n'' as differences of average height (high-low) and dynamic patterns (rise, fall, gradual fall, and rise&fall). Using controlled single utterances of ''n'', multiple dimensional scaling analysis was applied to a mutual distance matrix obtained by 26 dimensional vectors expressing perceptual impressions. The result showed the three-dimensional structure of a perceptual impression space, and each dimension corresponded to different F0 control characteristics. The positive-negative impression can be controlled by average F0 height while confident-doubtful or allowable-unacceptable impressions can be controlled by F0 dynamic patterns. Unlike conventional categorical classification of prosodic patterns frequently observed in studies of emotional prosody, this control characterization enables us to flexibly and quantitatively describe prosodic impressions. These experimental results allow the possibility of input specifications for communicative prosody generation using impression vectors and control through average F0 height and F0 dynamic patterns. Instead of the generation of speech with categorical prototypical prosody, more adequate communicative speech synthesis can be approached through input specification and its correspondence with control characteristics.
Year
DOI
Venue
2009
10.1016/j.specom.2007.10.006
Speech Communication
Keywords
Field
DocType
control characterization,communicative speech synthesis,input specification,nonverbal information,paralinguistic prosody,fundamental frequency control,multiple dimensional scaling,adequate communicative speech synthesis,control characteristic,f0 control characteristic,categorical prototypical prosody,f0 dynamic pattern,f0 height,perceptual impression space,f0 control,paralinguistic prosody control,communicative prosodic variation,distance matrix,speech synthesis,fundamental frequency
Prosody,Speech synthesis,Paralanguage,Expression (mathematics),Computer science,Categorical variable,Impression,Speech recognition,Emotional prosody,Perception
Journal
Volume
Issue
ISSN
51
7
Speech Communication
Citations 
PageRank 
References 
5
0.90
1
Authors
5
Name
Order
Citations
PageRank
Yoko Greenberg172.37
Nagisa Shibuya250.90
Minoru Tsuzaki311818.89
Hiroaki Kato43718.25
Yoshinori Sagisaka5550112.31