Title
Using Automatic Stress Extraction From Audio For Improved Prosody Modelling In Speech Synthesis
Abstract
Generating proper and natural sounding prosody is one of the key interests of today's speech synthesis research. An important factor in this effort is the availability of a precisely labelled speech corpus with adequate prosodic stress marking. Obtaining such a labelling constitutes a huge effort, whereas inter annotator agreement scores are usually found far below 100%. Stress marking based on phonetic transcription is an alternative, but yields even poorer quality than human annotation. Applying an automatic labelling may help overcoming these difficulties. The current paper presents an automatic approach for stress detection based purely on audio, which is used to derive an automatic, layered labelling of stress events and link them to syllables. For proof of concept, a speech corpus was extended by the output of the stress detection algorithm and a HMM-TTS system was trained with the extended corpus. Results are compared to a baseline system, trained on the same database, but with stress marking obtained from textual transcripts after 4 plying a set of linguistic rules. The evaluation includes CMOS tests and the analysis of the decision trees. Results show an overall improvement in prosodic properties of the synthesized speech. Subjective ratings reveal a voice perceived as more natural.
Year
Venue
Keywords
2015
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5
prosody analysis, stress detection, speech synthesis, prosody generation, automatic prosody labelling
Field
DocType
Citations 
Speech corpus,Decision tree,Prosody,Speech synthesis,Annotation,Phonetic transcription,Computer science,Speech recognition,Proof of concept,Artificial intelligence,Natural language processing,Baseline system
Conference
3
PageRank 
References 
Authors
0.40
5
4
Name
Order
Citations
PageRank
György Szaszák15113.21
András Beke2225.51
Gábor Olaszy35013.40
Bálint Tóth4102.31