Integrated Expression Prediction and Speech Synthesis From Text - Citegraph

Paper Info

Title
Integrated Expression Prediction and Speech Synthesis From Text

Abstract
Generating expressive, naturally sounding, speech from text using a speech synthesis (TTS) system is a highly challenging problem. However for tasks such as audiobooks it is essential if their use is to become widespread. Generating expressive speech from text can be divided into two parts: predicting expressive information from text; and synthesizing the speech with a particular expression. Traditionally these components have been studied separately. This paper proposes an integrated approach, where the training data and representation of expressive synthesis is shared across the two components. There are several advantages to this scheme including: robust handling of automatically generated expressive labels; support for a continuous representation of expressions; and joint training of the expression predictor and speech synthesizer. Synthesis experiments indicated that the proposed approach produced far more expressive speech than both a neutral TTS and one where the expression was randomly selected. The experimental results also show the advantage of a continuous expressive synthesis space over a discrete space.

Year	DOI	Venue
2014	10.1109/JSTSP.2013.2294938	IEEE Journal of Selected Topics in Signal Processing
Keywords	Field	DocType
neural network,audiobook,expressive information,speech synthesis,cluster adaptive training,integrated expression prediction,expressive speech synthesis,tts system,speech synthesizer,text analysis,expressive synthesis,speech synthesis from text,hidden markov model,expression predictor,decision trees,speech,hidden markov models,vectors	Training set,Speech synthesis,Expression (mathematics),Computer science,Speech recognition,Artificial intelligence,Natural language processing,Discrete space	Journal
Volume	Issue	ISSN
8	2	1932-4553
Citations	PageRank	References
0	0.34	19
Authors
5

Authors (5 rows)

Cited by (0 rows)

References (19 rows)

Name	Order	Citations	PageRank
Langzhou Chen	1	62	8.92
Mark J. F. Gales	2	3905	367.45
Norbert Braunschweiler	3	59	8.47
Masami Akamine	4	89	15.15
Kate Knill	5	249	28.02

1