Title
Rich prosodic information exploration on spontaneous Mandarin speech
Abstract
In this paper, rich prosodic information of spontaneous Mandarin speech is explored. The joint prosody labeling and modeling algorithm proposed previously for read speech is extended to spontaneous-speech prosody modeling by additionally considering the modeling of disfluency speech parts. It trains a hierarchical prosodic model and performs prosody labeling from a large speech corpus automatically. Rich prosodic information is then explored via analyzing model parameters and labeling results. By comparing the resulting prosodic model with that of read speech, we find that most affecting patterns, such as F0 contour patterns of 4 tones, have similar shapes or same trends but with much less dynamic ranges. Besides, the prosodic characteristics of various disfluency events, including repetition, restart, repair, contraction, and hesitation, are intensively investigated based on the labeling results. The information explored increases our knowledge about the phonology of spontaneous speech, and should be useful for assisting in ASR.
Year
DOI
Venue
2016
10.1109/ISCSLP.2016.7918367
2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)
Keywords
Field
DocType
prosodic information,prosody modeling,prosody labeling,spontaneous Mandarin speech,disfluency event
Speech corpus,Speech processing,Prosody,Pragmatics,Information exploration,Computer science,Speech recognition,Context model,Artificial intelligence,Natural language processing,Phonology,Mandarin Chinese
Conference
ISBN
Citations 
PageRank 
978-1-5090-4295-1
0
0.34
References 
Authors
0
5
Name
Order
Citations
PageRank
Cheng-Hsien Lin100.34
Chung-Long You200.34
Chen-Yu Chiang33111.55
Yih-Ru Wang423734.68
Sin-Horng Chen527339.86