Title
Whistler: A Trainable Text-To-Speech System
Abstract
We introduce Whistler, a trainable Text-to-Speech (TTS) system, that automatically learns the model parameters from a corpus. Both prosody parameters and concatenative speech units are derived through the use of probabilistic learning methods that have been successfully used for speech recognition. Whistler can produce synthetic speech that sounds very natural and resembles the acoustic and prosodic characteristics oi the original speaker. The underlying technologies used in Whistler can significantly facilitate the process of creating generic TTS systems for a new language, a new voice, or a new speech style.
Year
DOI
Venue
1996
10.1109/ICSLP.1996.607289
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4
Keywords
Field
DocType
speech processing,loudspeakers,natural language processing,whistler,natural languages,probability,speech recognition,speech synthesis,text to speech,learning artificial intelligence
Speech corpus,Speech processing,Speech synthesis,Speech analytics,Computer science,Chinese speech synthesis,Speech recognition,Artificial intelligence,Natural language processing,VoxForge,Speech technology,Acoustic model
Conference
Citations 
PageRank 
References 
40
8.98
7
Authors
7
Name
Order
Citations
PageRank
Xuedong Huang11390283.19
A. Acero24390478.73
Jim Adcock3408.98
Hsiao-Wuen Hon41719354.37
John Goldsmith5408.98
Jingsong Liu6408.98
mike plumpe720827.43