Abstract | ||
---|---|---|
We describe models of Mandarin prosody that allow us to make quantitative measurements of prosodic strengths. These models use Stem-ML, which is a phenomenological model of the muscle dynamics and planning process that controls the tension of the vocal folds, and therefore the pitch of speech. Because Stem-ML describes the interactions between nearby tones, we were able to capture surface tonal variations using a highly constrained model with only one template for each lexical tone category, and a single prosodic strength per word. The model accurately reproduces the intonation of the speaker, capturing 87% of the variance of f0 with these strength parameters. The result reveals alternating metrical patterns in words, and shows that the speaker marks a hierarchy of boundaries by controlling the prosodic strength of words. The strengths we obtain are also correlated with syllable duration, mutual information and part-of-speech. |
Year | DOI | Venue |
---|---|---|
2003 | 10.1016/S0167-6393(03)00100-6 | Speech Communication |
Keywords | Field | DocType |
Intonation,Tone,Tonal variation,Prosodic structure,Metrical pattern,Prosodic strength,Prosody modeling,Muscle dynamics,Text-to-speech | Prosody,Speech synthesis,Computer science,Speech recognition,Planning process,Mutual information,Syllable,Hierarchy,Mandarin Chinese,Phenomenological model | Journal |
Volume | Issue | ISSN |
41 | 4 | 0167-6393 |
Citations | PageRank | References |
9 | 0.97 | 17 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Greg P. Kochanski | 1 | 215 | 19.97 |
Chilin Shih | 2 | 392 | 68.16 |
Hongyan Jing | 3 | 1524 | 112.18 |